Qwen3.5-27B
Qwen3.5-27B is a reasoning model developed by Alibaba. It has a parameter scale of about 270B and supports an extremely long context window of 1010K.
Parameters
270.0B
Context Window
1010K
License
https://huggingface.co/Qwen/Qwen2.5-72B/blob/main/LICENSE
Release Date
2026-02-25
API Pricing
API pricing for this model is not yet available
Strengths
- ・Provides advanced reasoning capabilities
- ・Handles massive context processing
- ・Large-scale parameter configuration
Weaknesses
- ・Closed-source license
- ・Large model file size
- ・High runtime resource requirements
Use Cases
- ・Complex logical reasoning tasks
- ・Ultra-long document analysis
- ・Advanced knowledge extraction
Deep Analysis
Release Date
February 24, 2026
Parameters
27B
Dense model — all parameters active
Architecture
Hybrid: Gated DeltaNet + Gated Attention (dense)
Context Window
262,144 tokens (native)
Modalities
Text, Image, Video
VRAM (Q4)
~16 GB
VRAM (BF16)
~54 GB
Inference Speed
~35 tok/s on RTX 3090 at Q4
License
Apache 2.0
MMLU-Pro
86.1
Strengths
- ・Best creative writing quality in the Qwen3.5 family — denser computation produces more consistent prose
- ・Strong reasoning: GPQA Diamond 85.5, MMLU-Pro 86.1, IFEval 95.0
- ・Fits on a single 24GB GPU at Q4 quantization (~16GB VRAM)
- ・Dense architecture means simpler deployment — no MoE routing complexity
- ・Natively multimodal with vision and video support
Weaknesses
- ・Slower inference (~35 tok/s) compared to the 35B-A3B MoE model (196 tok/s)
- ・Lacks the raw speed for batch processing and real-time agent workflows
- ・Trails the 35B-A3B on throughput-sensitive tasks despite having more active parameters
- ・Not available as an API model through major providers (primarily self-hosted)
- ・Quantization at Q4 may impact quality for nuanced creative tasks
Competitor Comparison
| Model | Arena | SWE | GPQA | Price |
|---|---|---|---|---|
| Qwen3.5-35B-A3B | ~1390 | ~65 | ~83 | Open-source |
| Qwen3.5-9B | ~1370 | ~60 | 81.7 | Open-source |
| Llama 4 Scout | ~1380 | ~65 | ~80 | Open-source |
| Qwen3.5-27B | ~1400 | ~68 | 85.5 | Open-source |
| Mistral Large 2 | ~1370 | ~64 | ~78 | Open-source |
Qwen3.5-27B is the only dense model in the mid-range of the Qwen3.5 family, offering 27B parameters with all of them active on every token. Released February 24, 2026 under Apache 2.0, it excels at creative writing and complex reasoning where every parameter contributes to output quality. It runs at ~35 tok/s on a single RTX 3090 at Q4, fitting comfortably in 16GB VRAM.
Sources
Analysis generated: 2026-05-24