> ## Documentation Index
> Fetch the complete documentation index at: https://docs.timbal.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Cerebras

> Wafer-scale inference at world-record token speeds with specs, pricing, and capabilities

<Note>Source: [Cerebras model docs](https://inference-docs.cerebras.ai/introduction). All model IDs use the prefix `cerebras/`. Powered by Cerebras wafer-scale chips — the world's largest AI accelerator — delivering up to 3000+ tokens/second.</Note>

<Warning>Cerebras models are not yet available through the Timbal platform proxy. A `CEREBRAS_API_KEY` is required to use these models. If you'd like to access Cerebras models via your Timbal API key, please [contact sales](https://timbal.ai).</Warning>

## All Models

<CardGroup cols={2}>
  <Card title="gpt-oss-120b">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `cerebras/gpt-oss-120b`

    OpenAI's open-weight MoE model with 120B total parameters (5.1B active per token), running at up to 3000 tokens/s on Cerebras wafer-scale hardware. Near-parity with o4-mini on reasoning benchmarks. Supports extended thinking. Apache 2.0.

    * \$0.35 / \$0.75
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
    * <Icon icon="brain" size={14} /> Thinking
    * <Icon icon="calendar" size={14} /> Knowledge cutoff Jun 2024
  </Card>

  <Card title="zai-glm-4.7">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `cerebras/zai-glm-4.7`

    ZAI GLM 4.7 with 355B parameters, running at \~1000 tokens/s on Cerebras hardware. Strong multilingual reasoning and instruction-following capabilities.

    * \$2.25 / \$2.75
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
    * <Icon icon="calendar" size={14} /> Knowledge cutoff \~early 2025
  </Card>
</CardGroup>
