Skip to main content
Every model supported by Timbal with full specs, pricing, capability scores, and short descriptions. Price = per 1M tokens (input / output). All models support tool/function calling unless noted.
https://content.timbal.ai/assets/openai_favicon.svg

OpenAI

GPT-5, GPT-4, and o-series reasoning models
https://content.timbal.ai/assets/claude_favicon.svg

Anthropic

Claude 4 Opus, Sonnet, and Haiku
https://content.timbal.ai/assets/gemini_favicon.svg

Google

Gemini 3 and 2.5 series
https://content.timbal.ai/assets/grok_favicon.svg

xAI

Grok 4 and Grok 4 Fast
https://content.timbal.ai/assets/together_favicon.svg

TogetherAI

Open-source models via TogetherAI
https://content.timbal.ai/assets/groq_favicon.svg

Groq

Ultra-low latency via Groq LPU
https://content.timbal.ai/assets/fireworks_favicon.svg

Fireworks

Open-source models via Fireworks
https://content.timbal.ai/assets/xiaomi_favicon_rounded.svg

Xiaomi MiMo

MiMo V2 Pro, Omni, and Flash
https://content.timbal.ai/assets/byteplus_favicon.svg

BytePlus

Seed 2.0 and Seed 1.8 models
https://content.timbal.ai/assets/cerebras_favicon.svg

Cerebras

Wafer-scale inference at world-record token speeds
https://content.timbal.ai/assets/sambanova_favicon.svg

SambaNova

High-throughput inference on custom RDU hardware

Scoring

Each model is rated on two axes using a 1-5 scale:
  • Reasoning — depth of analytical and chain-of-thought capability
  • Speed — relative latency and throughput for its class
Scores are relative within the full set of models on this page, not within a single provider. A reasoning score of 5 means frontier-class reasoning (e.g. o3-pro, Claude Opus 4.7, GPT-5.5, Gemini 2.5 Pro). A speed score of 5 means the fastest tier (nano/mini/flash-lite models, or Groq-hosted inference).