모델 목록으로
DeepSeek오픈소스

DeepSeek V3.2

An open-source large-scale model developed by China's DeepSeek-AI. It adopts the Mixture-of-Experts (MoE) architecture, with an efficient design where only a portion of the 685B parameters are activated during inference. While available for commercial use under the MIT license, its API fees are also extremely low.

파라미터

685B (MoE)

컨텍스트

128K

라이선스

MIT

출시일

2026-03-28

일본어 처리 능력

High-Quality JP

Multilingual model with strong Japanese language processing capabilities.

API 가격

입력 가격 (1M 토큰당)

$0.27

출력 가격 (1M 토큰당)

$1.1

과금 모드: standard

강점

  • MIT license allows commercial use
  • Extremely low API costs
  • Efficient MoE architecture
  • Supports 128K context

약점

  • Japanese processing inferior to specialized models
  • Servers may be located in mainland China
  • Slightly slow inference speed

활용 사례

  • Large-scale processing focused on cost savings
  • On-premises model deployment
  • Research on MoE architecture
  • Embedding into commercial services

심층 분석

Arena Elo

1485

Thinking variant, #3 overall on BenchLM

SWE-Bench Verified

80.8%

vs GPT-5.2: 80.0% (paper claim)

Input Price

$0.28/1M tokens

~20x cheaper than GPT-5.2

Output Price

$1.10/1M tokens

~50x cheaper than GPT-5.2

Context Window

128K tokens

Standard frontier length

Total Parameters

685B

37B active via MoE architecture

License

MIT

Fully open-source, commercial use allowed

강점

  • Extraordinary cost-to-performance ratio with pricing ~20x lower than GPT-5.2
  • MIT license enables unrestricted commercial use and self-hosting
  • Gold-medal level performance in IMO and IOI competitions
  • DeepSeek Sparse Attention dramatically improves long-context efficiency
  • Exceptional agentic capabilities with integrated thinking and tool use
  • Strong coding performance competitive with frontier models

약점

  • Slower inference speed (43 t/s measured) compared to most API competitors
  • Creative writing and conversational tone lag behind proprietary models
  • 128K context window is smallest among major frontier models
  • Chinese jurisdiction raises data privacy concerns for some enterprises
  • Weaker safety guardrails compared to Anthropic or OpenAI offerings
  • Occasional inconsistencies in complex multi-step instruction following

경쟁사 비교

ModelArenaSWEGPQAPrice
GPT-5 High1550+80.0%85.7%$15/$60
Gemini 3.0 Pro1520+76.2%91.9%$10/$40
Kimi K2 Thinking1480+71.3%84.5%Est. $5/$20
Claude 4.5 Sonnet147577.2%83.4%$3/$15

DeepSeek V3.2 represents a paradigm shift in open-source AI, delivering near-frontier performance at a fraction of proprietary model costs. This 685B parameter Mixture-of-Experts model activates only 37B parameters during inference, achieving remarkable computational efficiency while maintaining competitive benchmark scores. The model's DeepSeek Sparse Attention mechanism reduces complexity from O(L²) to O(Lk) for long-context processing, addressing a critical efficiency bottleneck in transformer architectures.

Released under the MIT license in December 2025, V3.2 bridges the gap between open-source and closed-source models, particularly excelling in mathematical reasoning (gold medals in IMO and IOI), coding tasks, and agentic capabilities. Its 'thinking with tools' innovation integrates chain-of-thought reasoning with tool execution, enabling more sophisticated problem-solving approaches. At approximately $0.28 per million input tokens, it offers approximately 20x cost savings over GPT-5.2 while delivering comparable performance on key benchmarks.

The model's significance extends beyond raw performance metrics. It demonstrates that frontier-level AI capabilities need not require frontier-level pricing or proprietary restrictions. For organizations prioritizing cost efficiency, data sovereignty, and customization potential, DeepSeek V3.2 presents a compelling alternative to closed-source offerings, though with trade-offs in speed, creative capabilities, and certain safety features.

분석 생성일: 2026-05-23