Skip to main content
Source: Fireworks model docs. All model IDs use the prefix fireworks/accounts/fireworks/models/. Only serverless models are listed here — other Fireworks models require a dedicated deployment.

All Models

deepseek-v4-pro

Reasoning · Speedfireworks/accounts/fireworks/models/deepseek-v4-proDeepSeek V4 Pro on Fireworks serverless: frontier open MoE for coding, reasoning, and up to ~1M token context with function calling.
  • $1.74 / $3.48
  • 1M context
  • Text input

qwen3p6-plus

Reasoning · Speedfireworks/accounts/fireworks/models/qwen3p6-plusQwen 3.6 multimodal plus tier on Fireworks for vision-language and general agent tasks.
  • $0.50 / $3.00
  • 262K context
  • Text, Image input

kimi-k2p6

Reasoning · Speedfireworks/accounts/fireworks/models/kimi-k2p6Moonshot Kimi K2.6 on Fireworks: multimodal MoE for high-quality tool use and long-context workloads.
  • $0.95 / $4.00
  • 262K context
  • Text, Image input

kimi-k2p5

Reasoning · Speedfireworks/accounts/fireworks/models/kimi-k2p5Moonshot Kimi K2.5 on Fireworks serverless: multimodal 1T-parameter MoE with strong agentic tool use.
  • $0.60 / $3.00
  • 256K context
  • Text, Image input

glm-5p1

Reasoning · Speedfireworks/accounts/fireworks/models/glm-5p1Z.ai GLM-5.1 on Fireworks serverless: post-training upgrade with stronger coding, reasoning, and agentic tool use.
  • $1.00 / $3.20
  • 200K context
  • Text input

minimax-m2p5

Reasoning · Speedfireworks/accounts/fireworks/models/minimax-m2p5MiniMax M2.5 MoE (230B total, 10B active) with SOTA coding and agentic tool use on Fireworks serverless.
  • $0.30 / $1.20
  • 200K context
  • Text input

gpt-oss-120b

Reasoning · Speedfireworks/accounts/fireworks/models/gpt-oss-120bOpenAI’s open-weight 120B MoE achieving near-parity with o4-mini on reasoning benchmarks. Apache 2.0.
  • $0.15 / $0.60
  • 128K context
  • Text input

gpt-oss-20b

Reasoning · Speedfireworks/accounts/fireworks/models/gpt-oss-20bOpenAI’s compact 20B MoE similar to o3-mini, running on edge devices with 16GB memory. Apache 2.0.
  • $0.07 / $0.30
  • 128K context
  • Text input