Source: SambaNova model docs. All model IDs use the prefix
sambanova/. Powered by SambaNova RDU (Reconfigurable Dataflow Unit) chips designed for large-scale AI inference.DeepSeek
DeepSeek-R1-0528
Reasoning · Speed
sambanova/DeepSeek-R1-0528DeepSeek’s latest R1 reasoning model running on SambaNova RDU hardware. Frontier-class chain-of-thought reasoning with extended thinking support.- $5.00 / $7.00
- 128K context
- Text input
- Thinking
- Knowledge cutoff ~early 2025
DeepSeek-V3.2
Reasoning · Speed
sambanova/DeepSeek-V3.2DeepSeek V3.2 preview running on SambaNova RDU hardware. Latest iteration of the V3 series with further capability improvements.- $3.00 / $4.50
- 8K context
- Text input
- Knowledge cutoff ~early 2025
DeepSeek-V3.1
Reasoning · Speed
sambanova/DeepSeek-V3.1DeepSeek V3.1 running on SambaNova RDU hardware. Updated general-purpose model with improved instruction-following and coding capabilities.- $3.00 / $4.50
- 128K context
- Text input
- Knowledge cutoff ~early 2025
DeepSeek-V3.1-Terminus
Reasoning · Speed
sambanova/DeepSeek-V3.1-TerminusDeepSeek V3.1 Terminus preview running on SambaNova RDU hardware. Preview release with enhanced capabilities for complex tasks.- $3.00 / $4.50
- 128K context
- Text input
- Knowledge cutoff ~early 2025
DeepSeek-V3.1-cb
Reasoning · Speed
sambanova/DeepSeek-V3.1-cbDeepSeek V3.1 community tier running on SambaNova RDU hardware at significantly reduced pricing. Ideal for high-volume workloads.- $0.15 / $0.75
- 128K context
- Text input
- Knowledge cutoff ~early 2025
DeepSeek-V3-0324
Reasoning · Speed
sambanova/DeepSeek-V3-0324DeepSeek V3 March 2024 release running on SambaNova RDU hardware. Strong general-purpose model for coding, reasoning, and instruction-following.- $3.00 / $4.50
- 128K context
- Text input
- Knowledge cutoff ~early 2024
Qwen
Qwen3-235B-A22B-Instruct-2507
Reasoning · Speed
sambanova/Qwen3-235B-A22B-Instruct-2507Qwen3 235B MoE model with 22B active parameters running on SambaNova RDU hardware. Supports hybrid thinking mode with strong multilingual and agentic capabilities.- $0.40 / $0.80
- 64K context
- Text input
- Hybrid thinking
- Knowledge cutoff ~early 2025
Qwen3-32B
Reasoning · Speed
sambanova/Qwen3-32BQwen3 32B dense model running on SambaNova RDU hardware. Supports hybrid thinking mode with strong multilingual capabilities.- $0.40 / $0.80
- 32K context
- Text input
- Hybrid thinking
- Knowledge cutoff ~early 2025
Meta Llama
Llama-4-Maverick-17B-128E-Instruct
Reasoning · Speed
sambanova/Llama-4-Maverick-17B-128E-InstructMeta Llama 4 Maverick preview with 17B active parameters across 128 experts running on SambaNova RDU hardware. Supports image input.- $0.63 / $1.80
- 128K context
- Text, Image input
- Knowledge cutoff ~early 2025
Meta-Llama-3.3-70B-Instruct
Reasoning · Speed
sambanova/Meta-Llama-3.3-70B-InstructMeta Llama 3.3 70B instruction-tuned model running on SambaNova RDU hardware. Multilingual, strong at coding and reasoning tasks.- $0.60 / $1.20
- 128K context
- Text input
- Knowledge cutoff Dec 2023
Llama-3.3-Swallow-70B-Instruct-v0.4
Reasoning · Speed
sambanova/Llama-3.3-Swallow-70B-Instruct-v0.4Japanese-tuned Llama 3.3 70B by TokyoTech-LLM running on SambaNova RDU hardware. Optimized for Japanese language tasks while retaining strong English capabilities.- $0.60 / $1.20
- 16K context
- Text input
- Knowledge cutoff Dec 2023
Meta-Llama-3.1-8B-Instruct
Reasoning · Speed
sambanova/Meta-Llama-3.1-8B-InstructMeta Llama 3.1 8B instruction-tuned model running on SambaNova RDU hardware. The most affordable and fastest entry point on SambaNova.- $0.10 / $0.20
- 16K context
- Text input
- Knowledge cutoff Jul 2024
OpenAI OSS
gpt-oss-120b
Reasoning · Speed
sambanova/gpt-oss-120bOpenAI’s open-weight MoE model with 120B total parameters running on SambaNova RDU hardware. Near-parity with o4-mini on reasoning benchmarks. Apache 2.0.- $0.22 / $0.59
- 128K context
- Text input
- Thinking
- Knowledge cutoff Jun 2024
MiniMax
MiniMax-M2.5
Reasoning · Speed
sambanova/MiniMax-M2.5MiniMax M2.5 large-scale model running on SambaNova RDU hardware with an industry-leading 160K context window.- $0.30 / $1.20
- 160K context
- Text input
- Knowledge cutoff ~early 2025