모델 목록으로
Alibaba독점

Qwen3-Max-Thinking

Qwen3-Max-Thinking is an inference model developed by Alibaba. It has a large-scale configuration of approximately 10 trillion parameters and a very long context window of 1 million tokens.

파라미터

10000.0B

컨텍스트

1000K

라이선스

Proprietary

출시일

2026-01-26

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • Overwhelming parameter scale
  • 1 million token long-context comprehension
  • Pursuing advanced reasoning capabilities

약점

  • Non-public closed model
  • Limited license
  • Detailed performance metrics not disclosed

활용 사례

  • Executing complex logical reasoning
  • Analyzing ultra-large-scale data
  • Automating advanced problem-solving

심층 분석

Release Date

January 23, 2026

Parameters

Proprietary (undisclosed)

Context Window

262,144 tokens

Architecture

Decoder-only with extended thinking

Input Price

$0.78/1M tokens

Output Price

$3.90/1M tokens

GPQA Diamond

87.4

SWE-bench Verified

75.3

HLE (w/ tools)

49.8

API Model Name

qwen3-max-2026-01-23

강점

  • Competitive with GPT-5.2-Thinking and Claude-Opus-4.5 on 19 established benchmarks
  • Adaptive tool-use: autonomously invokes Search, Memory, and Code Interpreter
  • Excellent value: $0.78/$3.90 pricing is much cheaper than GPT-5.2 and Claude Opus 4.5
  • 100% reliability rate across evaluated benchmarks — never fails to produce output
  • Strong on C-Eval (93.7), HLE with tools (49.8), and coding tasks (97th percentile)

약점

  • General knowledge is a notable weakness (23rd percentile on broad factual recall)
  • Trails GPT-5.2-Thinking on MMLU-Pro (85.7 vs 87.4) and GPQA (87.4 vs 92.4)
  • Now partially superseded by Qwen3.7-Max as the flagship reasoning model
  • Test-time scaling adds latency and token cost for heavy mode
  • SWE-bench 75.3% trails Claude Opus 4.5 (80.9%) and GPT-5.2 (80.0%)

경쟁사 비교

ModelArenaSWEGPQAPrice
GPT-5.2-Thinking~150080.092.4Proprietary
Claude-Opus-4.5~149080.987.0Proprietary
Gemini 3 Pro~148076.291.9Proprietary
Qwen3-Max-Thinking~145075.387.4$0.78/$3.90
DeepSeek V3.2~143073.182.4Proprietary

Qwen3-Max-Thinking is Alibaba's flagship reasoning model from the Qwen3 generation, released January 23, 2026. It achieves competitive performance with GPT-5.2-Thinking and Claude-Opus-4.5 across 19 benchmarks while offering significantly lower pricing ($0.78/$3.90 per 1M tokens). Key innovations include adaptive tool-use capabilities and an experience-cumulative test-time scaling strategy that boosts reasoning through iterative self-reflection.

분석 생성일: 2026-05-24