Qwen3.5-122B-A10B
Qwen3.5-122B-A10B is a reasoning model developed by Alibaba. It features a large-scale parameter count of approximately 1220.0B and a long context window of 262K.
Parameters
1220.0B
Context Window
262K
License
https://huggingface.co/Qwen/Qwen2.5-72B/blob/main/LICENSE
Release Date
2026-02-25
API Pricing
API pricing for this model is not yet available
Strengths
- ・Over 1.2 trillion parameters for vast capacity
- ・262K extremely long context understanding
- ・Achieves advanced reasoning capabilities
Weaknesses
- ・Closed-source license model
- ・High computational cost from massive model size
- ・License with usage restrictions
Use Cases
- ・Complex logical reasoning tasks
- ・Analysis of ultra-long documents
- ・Advanced knowledge-intensive tasks
Deep Analysis
Release Date
February 2026
Total Parameters
122B
MoE architecture
Active Parameters
10B per token
256 experts with selective activation
Context Window
262,144 tokens
Architecture
Hybrid MoE: Gated DeltaNet + Gated Attention
Modalities
Text, Image, Video
VRAM (Q4)
~70 GB
VRAM (BF16)
~244 GB
License
Apache 2.0
API Price
Available via DashScope, SiliconFlow, DeepInfra
Strengths
- ・Strong quality-to-compute ratio: 10B active parameters deliver near-frontier performance
- ・Natively multimodal with text, image, and video support from the same weights
- ・Fits on multi-GPU consumer setups at Q4 (~70GB VRAM) — accessible for serious enthusiasts
- ・262K native context with hybrid DeltaNet architecture for fast long-context inference
- ・Apache 2.0 license enables commercial use and fine-tuning
Weaknesses
- ・Positioned awkwardly between the 397B flagship and the 35B-A3B speed model
- ・70GB VRAM at Q4 still requires multi-GPU setup (2x RTX 4090 or better)
- ・No dedicated benchmark spotlight — overshadowed by the 397B and 9B in marketing
- ・Active parameters (10B) may be insufficient for the most demanding reasoning tasks
- ・Community adoption has been slower compared to the 9B, 27B, and 35B-A3B
Competitor Comparison
| Model | Arena | SWE | GPQA | Price |
|---|---|---|---|---|
| Qwen3.5-397B-A17B | ~1450 | 76.4 | 88.4 | $0.40/$2.40 |
| Qwen3.5-27B | ~1400 | ~68 | 85.5 | Open-source |
| Llama 4 Scout | ~1380 | ~65 | ~80 | Open-source |
| Qwen3.5-122B-A10B | ~1420 | ~72 | ~86 | Open-source |
| DeepSeek V3 | ~1410 | ~70 | ~82 | $0.14/$0.28 |
Qwen3.5-122B-A10B is the mid-tier MoE model in the Qwen3.5 family, with 122B total parameters and 10B active per token. It provides a strong balance between quality and inference efficiency, fitting on multi-GPU setups at ~70GB VRAM in Q4 quantization. Released under Apache 2.0 in February 2026, it is natively multimodal and benefits from the same hybrid DeltaNet architecture as its larger sibling.
Sources
Analysis generated: 2026-05-24