Gemini 2.5 Deep Think
Gemini 2.5 Deep Think is an inference model developed by Google DeepMind. It is a chat-specialized foundational model equipped with a very long context window of 1000K.
Parameters
Undisclosed
Context Window
1000K
License
Proprietary
Release Date
2025-08-01
API Pricing
API pricing for this model is not yet available
Strengths
- ・Extremely long context window.
- ・Achieves advanced reasoning capabilities.
- ・Developed by Google DeepMind.
Weaknesses
- ・Closed-source license.
- ・Internal model details are undisclosed.
- ・Potential usage restrictions.
Use Cases
- ・Analysis of large documents.
- ・Complex logical reasoning tasks.
- ・Advanced conversational chat use.
Deep Analysis
GPQA Diamond
92%
PhD-level science reasoning, tops Claude 4.7 (88%) and GPT-5 (90%)
SWE-bench Verified
~78%
Coding benchmark, behind Claude 4.7 (~85%) and GPT-5 (~80%)
Arena Elo
~1465
Based on Gemini 2.5 Pro base, Deep Think adds extended reasoning
Context Window
1M tokens
1,000,000 input tokens
Input Price
$1.25/M tokens
Cheapest frontier reasoning model at 1M-token scale
Output Price
$10.00/M tokens
Standard output pricing for reasoning tier
Output Speed
~30 tok/s
Slower than GPT-5 (~110) and Claude 4.7 (~80) due to deep reasoning chains
Release
April 2026 (GA)
Opened to all paid API users, previously AI Ultra only
Strengths
- ・Highest GPQA Diamond score (92%) among frontier models as of May 2026
- ・Cheapest frontier reasoning model at $1.25/M input
- ・1M token context with 98% needle-in-haystack at 500K
- ・Gold-medal level on International Math Olympiad
- ・Strong needle-in-haystack retrieval (98% at 500K tokens)
Weaknesses
- ・Slower output speed (~30 tok/s) vs competitors due to extended thinking
- ・Time-to-first-token ~1.1s, slower than GPT-5 (0.4s) and Claude 4.7 (0.6s)
- ・Weaker coding performance (78% SWE-bench) vs Claude 4.7 (85%)
- ・Computer use / browser capability still experimental
- ・Deep Think mode requires more compute, increasing effective cost
Competitor Comparison
| Model | Arena | SWE | GPQA | Price |
|---|---|---|---|---|
| Claude 4.7 Sonnet | ~1470 | ~85% | 88% | $3/$15 per 1M |
| GPT-5 | ~1480 | ~80% | 90% | $5/$20 per 1M |
| Gemini 2.5 Deep Think | ~1465 | ~78% | 92% | $1.25/$10 per 1M |
Sources
Analysis generated: 2026-05-24