모델 목록으로
Google Deep Mind독점

Gemini 3.2 Flash (Unreleased Preview)

Gemini 3.2 Flash (Unreleased Preview) is a reasoning model developed by Google DeepMind. Equipped with a very long context window of 2M tokens, it is capable of processing large-scale data.

파라미터

Undisclosed

컨텍스트

2M

라이선스

Proprietary

출시일

2026-06-15

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • Advanced reasoning capabilities
  • Extensive 2-million token context understanding
  • Latest design from Google DeepMind

약점

  • Closed-source model
  • Unreleased preview version
  • Limited detailed performance evaluation

활용 사례

  • Analysis of large documents
  • Complex logical reasoning tasks
  • Processing long-context information

심층 분석

LM Arena Ranking

Rising rapidly

Performing 'two tiers above expected weight class' in blind Arena battles

SWE-Bench Verified

~78%

Gemini 3 Flash tier; exceeds Gemini 3.1 Pro (76.2%) on agentic coding

LiveCodeBench

90.8%

Second only to Gemini 3 Pro Preview (91.7%)

Input Price (leaked)

$0.25/1M tokens

50% cheaper than Gemini 3 Flash ($0.50); matches Flash-Lite

Output Price (leaked)

$2.00/1M tokens

33% cheaper than Gemini 3 Flash ($3.00)

Context Window

1M tokens

In + 64K out; consistent with Gemini 3 series (user-specified 2M unconfirmed by leaks)

Inference Speed

2.7x faster than Pro

Sub-200ms first-token latency on many prompts; some tasks completed in <2 min vs ~5 min for Pro

강점

  • Pro-level coding and 3D spatial reasoning at Flash-tier pricing — a genuine 'tier collapse' between efficiency and flagship models
  • Aggressive leaked pricing ($0.25/$2.00) slashes input costs 50% and output costs 33% versus predecessor Gemini 3 Flash
  • Native multimodal support (text, image, audio, video) with 1M-token context window — unmatched at this price point among competitors like DeepSeek V4-Flash

약점

  • Not officially released — all specs, benchmarks, and pricing are based on leaks and anonymous Arena testing, subject to change at I/O
  • Weaker than GPT-5.5 on the hardest multi-file engineering tasks (SWE-Bench Pro: 58.6% for GPT-5.5 vs lower for Gemini) and complex chain-of-thought reasoning
  • Slightly higher latency than Gemini 3.1 Flash for short, snappy conversational queries; not ideal for real-time voice applications

경쟁사 비교

ModelArenaSWEGPQAPrice
DeepSeek V4-FlashN/A (open-weight)N/AN/A$0.14/$0.14 (est.)
Claude Haiku 4.7N/AN/AN/APremium efficiency tier
GPT-5.3 InstantN/AN/AN/ATiered API (higher than Gemini Flash)
Gemini 3.1 Pro (Google)N/A76.2%94.3%$1.25–$2.50 / $10.00–$15.00

Gemini 3.2 Flash (Unreleased Preview) is Google DeepMind's next-generation efficiency-tier model that appeared unexpectedly on May 5, 2026, via leaks in the iOS Gemini app, Google AI Studio metadata, and anonymous LM Arena evaluations ahead of Google I/O 2026. If the leaked specifications hold, it represents a fundamental shift in the cost-to-performance calculus for developers: Pro-level coding and 3D spatial reasoning at half the input cost of its predecessor, Gemini 3 Flash. Early testers reported the model generating 2,200-line interactive codebases, functional SVG environments, and playable Three.js scenes from single prompts — tasks that Gemini 3.1 Pro itself struggled to complete cleanly. The model's ability to 'punch above its weight class' appears to stem from advanced on-policy distillation and sparse Mixture-of-Experts routing that compresses Pro-tier reasoning pathways into the faster, cheaper Flash architecture.

The competitive implications are significant. At $0.25 per million input tokens and $2.00 per million output tokens, Gemini 3.2 Flash enters a market dominated by DeepSeek V4-Flash ($0.14/M input, text-only) and Anthropic's Claude Haiku 4.7 (premium pricing, 200K context). Google's key differentiator is native multimodal support across text, image, audio, and video paired with a massive 1-million-token context window — capabilities no competitor at this price point currently offers. However, the model is not a frontier-killer: GPT-5.5 still leads on the hardest multi-file engineering tasks (SWE-Bench Pro: 58.6% vs. lower for Gemini), and complex chain-of-thought scientific reasoning remains Gemini 3.1 Pro's domain.

The broader strategic signal is Google's shift to a software-style rapid release cadence — 3.0 Flash in December 2025, 3.1 Flash-Lite in March 2026, 3.2 Flash now in May — with sub-quarterly update cycles. Combined with the leaked 'Liquid Glass' UI redesign and a placeholder 'Agents (Beta)' tab in the Gemini app, this release appears to be the opening salvo of Google's agentic AI strategy for 2026. The Vertex AI deprecation notices for legacy Gemini 2 Flash (deadline: June 1, 2026) add urgency for enterprise customers to migrate. As of May 18, 2026, none of this is officially confirmed; the I/O keynote on May 19 is expected to settle all questions.

분석 생성일: 2026-05-23