Back to Models
Google Deep MindProprietary

Gemini 2.5 Deep Think

Gemini 2.5 Deep Think is an inference model developed by Google DeepMind. It is a chat-specialized foundational model equipped with a very long context window of 1000K.

Parameters

Undisclosed

Context Window

1000K

License

Proprietary

Release Date

2025-08-01

API Pricing

API pricing for this model is not yet available

Strengths

  • Extremely long context window.
  • Achieves advanced reasoning capabilities.
  • Developed by Google DeepMind.

Weaknesses

  • Closed-source license.
  • Internal model details are undisclosed.
  • Potential usage restrictions.

Use Cases

  • Analysis of large documents.
  • Complex logical reasoning tasks.
  • Advanced conversational chat use.

Deep Analysis

GPQA Diamond

92%

PhD-level science reasoning, tops Claude 4.7 (88%) and GPT-5 (90%)

SWE-bench Verified

~78%

Coding benchmark, behind Claude 4.7 (~85%) and GPT-5 (~80%)

Arena Elo

~1465

Based on Gemini 2.5 Pro base, Deep Think adds extended reasoning

Context Window

1M tokens

1,000,000 input tokens

Input Price

$1.25/M tokens

Cheapest frontier reasoning model at 1M-token scale

Output Price

$10.00/M tokens

Standard output pricing for reasoning tier

Output Speed

~30 tok/s

Slower than GPT-5 (~110) and Claude 4.7 (~80) due to deep reasoning chains

Release

April 2026 (GA)

Opened to all paid API users, previously AI Ultra only

Strengths

  • Highest GPQA Diamond score (92%) among frontier models as of May 2026
  • Cheapest frontier reasoning model at $1.25/M input
  • 1M token context with 98% needle-in-haystack at 500K
  • Gold-medal level on International Math Olympiad
  • Strong needle-in-haystack retrieval (98% at 500K tokens)

Weaknesses

  • Slower output speed (~30 tok/s) vs competitors due to extended thinking
  • Time-to-first-token ~1.1s, slower than GPT-5 (0.4s) and Claude 4.7 (0.6s)
  • Weaker coding performance (78% SWE-bench) vs Claude 4.7 (85%)
  • Computer use / browser capability still experimental
  • Deep Think mode requires more compute, increasing effective cost

Competitor Comparison

ModelArenaSWEGPQAPrice
Claude 4.7 Sonnet~1470~85%88%$3/$15 per 1M
GPT-5~1480~80%90%$5/$20 per 1M
Gemini 2.5 Deep Think~1465~78%92%$1.25/$10 per 1M

Analysis generated: 2026-05-24