Source: Google AI model docs
Gemini 3
gemini-3.1-pro-preview
Reasoning · Speed
google/gemini-3.1-pro-previewThe upgraded core intelligence model representing a step forward in reasoning, a smarter and more capable baseline for complex problem-solving across consumer and developer products.- $2 / $12
- 1M context
- 64K max output
- Text, Image, Video, Audio, PDF input
- Thinking
- Web search
- Knowledge cutoff Jan 2025
gemini-3.1-flash-lite-preview
Reasoning · Speed
google/gemini-3.1-flash-lite-previewThe fastest and most cost-efficient Gemini 3 model, built for high-volume developer workloads at scale with 2.5x faster time to first token versus 2.5 Flash.- $0.25 / $1.50
- 1M context
- 64K max output
- Text, Image, Video, Audio input
- Thinking
- Web search
- Knowledge cutoff Jan 2025
gemini-3-flash-preview
Reasoning · Speed
google/gemini-3-flash-previewCombines Gemini 3 Pro’s reasoning capabilities with the Flash line’s latency, efficiency, and cost, designed for the most complex agentic workflows.- $0.50 / $3
- 1M context
- 64K max output
- Text, Image, Video, Audio input
- Thinking
- Web search
- Knowledge cutoff Jan 2025
Gemini 2.5
$2.50 / $15 above 200K input tokens. All Gemini models support Google Search grounding.
gemini-2.5-pro
Reasoning · Speed
google/gemini-2.5-proThe most advanced reasoning Gemini model, capable of solving complex problems across text, audio, images, video, and code repositories.- $1.25 / $10
- 1M context
- 65K max output
- Text, Image, Video, Audio, PDF input
- Extended thinking
- Web search
- Knowledge cutoff Jan 2025
gemini-2.5-pro-preview-tts
Reasoning · Speed
google/gemini-2.5-pro-preview-ttsText-to-speech variant of Gemini 2.5 Pro for generating audio output from text input.- $1.25 / $10
- 1M context
- 66K max output
- Text input
- Audio output
- Knowledge cutoff Jan 2025
gemini-2.5-flash
Reasoning · Speed
google/gemini-2.5-flashThe best model in terms of price and performance with well-rounded capabilities, the first Flash model featuring thinking capabilities.- $0.30 / $2.50
- 1M context
- 65K max output
- Text, Image, Video, Audio input
- Thinking
- Web search
- Knowledge cutoff Jan 2025
gemini-2.5-flash-lite
Reasoning · Speed
google/gemini-2.5-flash-liteThe fastest and most budget-friendly multimodal reasoning model in the 2.5 family, ideal for classification, translation, and high-scale operations.- $0.10 / $0.40
- 1M context
- 65K max output
- Text, Image, Video, Audio input
- Thinking
- Web search
- Knowledge cutoff Jan 2025
gemini-2.5-flash-native-audio
Reasoning · Speed
google/gemini-2.5-flash-native-audio-preview-12-2025Native audio dialog model for real-time voice conversations with thinking support.- $0.30 / $2.50
- 1M context
- 8K max output
- Text, Audio input
- Audio output
- Thinking
- Knowledge cutoff Jan 2025
gemini-2.5-flash-image
Reasoning · Speed
google/gemini-2.5-flash-imageSpecialized model for image generation and editing from text and image prompts. ~$0.039 per image output.- $0.30 / $30
- 32K context
- 32K max output
- Text, Image input
- Text, Image output
- Knowledge cutoff Jan 2025
gemini-2.5-flash-preview-tts
Reasoning · Speed
google/gemini-2.5-flash-preview-ttsText-to-speech variant of Gemini 2.5 Flash for generating audio output at lower cost.- $0.15 / $0.60
- 1M context
- 66K max output
- Text input
- Audio output
- Knowledge cutoff Jan 2025