Anthropic
Claude 4 Opus, Sonnet, and Haiku
BytePlus
Seed 2.0 and Seed 1.8 models
Cerebras
Wafer-scale inference at world-record token speeds
Fireworks
Open-source models via Fireworks
Gemini 3 and 2.5 series
Groq
Ultra-low latency via Groq LPU
OpenAI
GPT-5, GPT-4, and o-series reasoning models
SambaNova
High-throughput inference on custom RDU hardware
TogetherAI
Open-source models via TogetherAI
xAI
Grok 4 and Grok 4 Fast
Xiaomi MiMo
MiMo V2 Pro, Omni, and Flash
Scoring
Each model is rated on two axes using a 1-5 scale:- Reasoning — depth of analytical and chain-of-thought capability
- Speed — relative latency and throughput for its class