Source: Fireworks model docs. All model IDs use the prefix
fireworks/accounts/fireworks/models/. Only serverless models are listed here — other Fireworks models require a dedicated deployment.All Models
deepseek-v4-pro
Reasoning · Speed
fireworks/accounts/fireworks/models/deepseek-v4-proDeepSeek V4 Pro on Fireworks serverless: frontier open MoE for coding, reasoning, and up to ~1M token context with function calling.- $1.74 / $3.48
- 1M context
- Text input
qwen3p6-plus
Reasoning · Speed
fireworks/accounts/fireworks/models/qwen3p6-plusQwen 3.6 multimodal plus tier on Fireworks for vision-language and general agent tasks.- $0.50 / $3.00
- 262K context
- Text, Image input
kimi-k2p6
Reasoning · Speed
fireworks/accounts/fireworks/models/kimi-k2p6Moonshot Kimi K2.6 on Fireworks: multimodal MoE for high-quality tool use and long-context workloads.- $0.95 / $4.00
- 262K context
- Text, Image input
kimi-k2p5
Reasoning · Speed
fireworks/accounts/fireworks/models/kimi-k2p5Moonshot Kimi K2.5 on Fireworks serverless: multimodal 1T-parameter MoE with strong agentic tool use.- $0.60 / $3.00
- 256K context
- Text, Image input
glm-5p1
Reasoning · Speed
fireworks/accounts/fireworks/models/glm-5p1Z.ai GLM-5.1 on Fireworks serverless: post-training upgrade with stronger coding, reasoning, and agentic tool use.- $1.00 / $3.20
- 200K context
- Text input
minimax-m2p5
Reasoning · Speed
fireworks/accounts/fireworks/models/minimax-m2p5MiniMax M2.5 MoE (230B total, 10B active) with SOTA coding and agentic tool use on Fireworks serverless.- $0.30 / $1.20
- 200K context
- Text input
gpt-oss-120b
Reasoning · Speed
fireworks/accounts/fireworks/models/gpt-oss-120bOpenAI’s open-weight 120B MoE achieving near-parity with o4-mini on reasoning benchmarks. Apache 2.0.- $0.15 / $0.60
- 128K context
- Text input
gpt-oss-20b
Reasoning · Speed
fireworks/accounts/fireworks/models/gpt-oss-20bOpenAI’s compact 20B MoE similar to o3-mini, running on edge devices with 16GB memory. Apache 2.0.- $0.07 / $0.30
- 128K context
- Text input