> ## Documentation Index
> Fetch the complete documentation index at: https://docs.timbal.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# SambaNova

> High-throughput inference on custom RDU hardware with specs, pricing, and capabilities

<Note>Source: [SambaNova model docs](https://docs.sambanova.ai/docs/en/get-started/overview). All model IDs use the prefix `sambanova/`. Powered by SambaNova RDU (Reconfigurable Dataflow Unit) chips designed for large-scale AI inference.</Note>

<Warning>SambaNova models are not yet available through the Timbal platform proxy. A `SAMBANOVA_API_KEY` is required to use these models. If you'd like to access SambaNova models via your Timbal API key, please [contact sales](https://timbal.ai).</Warning>

## DeepSeek

<CardGroup cols={2}>
  <Card title="DeepSeek-V3.2">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/DeepSeek-V3.2`

    DeepSeek V3.2 running on SambaNova RDU hardware.

    * \$3.00 / \$4.50
    * <Icon icon="window-maximize" size={14} /> 8K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>

  <Card title="DeepSeek-V3.1">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/DeepSeek-V3.1`

    DeepSeek V3.1 running on SambaNova RDU hardware.

    * \$3.00 / \$4.50
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>
</CardGroup>

## Meta Llama

<CardGroup cols={2}>
  <Card title="Llama-4-Maverick-17B-128E-Instruct">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/Llama-4-Maverick-17B-128E-Instruct`

    Meta Llama 4 Maverick with 17B active parameters across 128 experts. Supports image input.

    * \$0.63 / \$1.80
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text, Image input
  </Card>

  <Card title="Meta-Llama-3.3-70B-Instruct">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/Meta-Llama-3.3-70B-Instruct`

    Meta Llama 3.3 70B instruction-tuned model running on SambaNova RDU hardware.

    * \$0.60 / \$1.20
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>
</CardGroup>

## Google Gemma

<CardGroup cols={2}>
  <Card title="gemma-4-31B-it">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/gemma-4-31B-it`

    Google Gemma 4 31B instruction-tuned model on SambaNova RDU hardware.

    * \$0.40 / \$0.80
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text, Image input
  </Card>

  <Card title="gemma-3-12b-it">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/gemma-3-12b-it`

    Google Gemma 3 12B instruction-tuned model on SambaNova RDU hardware.

    * \$0.10 / \$0.20
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>
</CardGroup>

## OpenAI OSS / MiniMax

<CardGroup cols={2}>
  <Card title="gpt-oss-120b">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/gpt-oss-120b`

    OpenAI's open-weight 120B MoE on SambaNova RDU hardware. Near-parity with o4-mini on reasoning benchmarks. Apache 2.0.

    * \$0.22 / \$0.59
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>

  <Card title="MiniMax-M2.7">
    <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> <Icon icon="lightbulb" size={14} /> Reasoning · <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> <Icon icon="bolt" size={14} /> Speed

    `sambanova/MiniMax-M2.7`

    MiniMax M2.7 large-scale MoE running on SambaNova RDU hardware.

    * \$0.30 / \$1.20
    * <Icon icon="window-maximize" size={14} /> 128K context
    * <Icon icon="keyboard" size={14} /> Text input
  </Card>
</CardGroup>
