What are the strengths of this model?

Advanced reasoning capabilities. Maintains 128K long context. Available under an open license.

What are the weaknesses of this model?

Enormous model size. Requires high computational resources. Increased operational costs.

What are the best use cases?

Complex logical reasoning tasks. Analysis of long documents. Advanced knowledge extraction.

Back to Models

Zhipu AIOpen Source

GLM-4.5-MoE-106B-A12B-0715

Name: GLM-4.5-MoE-106B-A12B-0715
Author: Zhipu AI

GLM-4.5-MoE-106B-A12B-0715 is an inference model developed by Zhipu AI. It is provided as a foundational model with a 128K context window and a large parameter scale.

Parameters

1060.0B

Context Window

128K

License

Apache 2.0

Release Date

2025-07-28

API Pricing

API pricing for this model is not yet available

Strengths

・Advanced reasoning capabilities.
・Maintains 128K long context.
・Available under an open license.

Weaknesses

・Enormous model size.
・Requires high computational resources.
・Increased operational costs.

Use Cases

・Complex logical reasoning tasks.
・Analysis of long documents.
・Advanced knowledge extraction.

Deep Analysis

Architecture

MoE (106B total, 12B active)

Lightweight GLM-4.5 variant

Context Window

128K tokens

Release Date

July 2025

License

Open weights

Apache 2.0

Specialization

Agent, Reasoning, Coding (ARC)

Purpose-built for agentic apps

Strengths

・Efficient 12B active parameters from 106B total
・128K context window
・Optimized for tool invocation and agents
・Supports hybrid reasoning modes (Thinking/Non-Thinking)
・Free Flash tier available
・Strong coding and agent capabilities

Weaknesses

・Less capable than full GLM-4.5 355B
・Chinese model with some English limitations
・Competing in crowded open-weight space

Competitor Comparison

Model	Arena	SWE	GPQA	Price
GLM-4.5 355B	-	-	-	Higher
DeepSeek V3	-	-	-	Comparable
Qwen 2.5 72B	-	-	-	Comparable

Overview

GLM-4.5-Air (106B-A12B) is Zhipu AI's cost-effective model in the GLM-4.5 series. With 106B total and 12B active parameters, it delivers strong reasoning, coding, and agent capabilities at lower cost than the full 355B variant.

Benchmarks & Performance

Competitive on AIME, GPQA, and coding benchmarks. Strong agent and tool-use performance. Supports Claude Code and Roo Code integration.

Detailed Comparison

Lighter alternative to GLM-4.5 355B. Comparable to DeepSeek V3 on many tasks at lower cost.

Community Feedback

Available on Z.AI platform. Free Flash tier available. Strong developer adoption in China.

Use Cases

Agentic applications, coding assistants, tool-calling workflows, and cost-sensitive production deployments.

Latest News

Part of Zhipu's GLM-4.5 family released mid-2025. Available alongside free Flash tier.

Sources

Analysis generated: 2026-05-24