What are the strengths of this model?

Large 310B parameter count 200K broad context length Openness via MIT license

What are the weaknesses of this model?

Large 62.5GB file size High hardware requirements Potential for increased inference cost

What are the best use cases?

Complex logical reasoning tasks Analysis of long documents Open-source AI development

Back to Models

Zhipu AIOpen Source

GLM-4.7-Flash

Name: GLM-4.7-Flash
Author: Zhipu AI

GLM-4.7-Flash is an inference model developed by Zhipu AI. It has a parameter scale of approximately 310.0B and supports a long context window of 200K.

Parameters

310.0B

Context Window

200K

License

MIT

Release Date

2026-01-19

API Pricing

API pricing for this model is not yet available

Strengths

・Large 310B parameter count
・200K broad context length
・Openness via MIT license

Weaknesses

・Large 62.5GB file size
・High hardware requirements
・Potential for increased inference cost

Use Cases

・Complex logical reasoning tasks
・Analysis of long documents
・Open-source AI development

Deep Analysis

Parameters

~310B MoE

Context Window

200K tokens

AA Intelligence Index

22.1

Output Speed

87.8 tok/s

Pricing

$0.06/$0.40 per 1M tokens

Release Date

January 19, 2026

Strengths

・Extremely affordable ($0.06/$0.40)
・200K context at budget price
・Text+image input
・Good speed (87.8 tok/s)

Weaknesses

・Low intelligence (AA Index 22.1)
・GPQA/HLE 0.5 each
・Not for complex reasoning
・Superseded by GLM-5

Competitor Comparison

Model	Price
GPT-4o-mini	$0.15/$0.60
DeepSeek V3-Chat	$0.27/$1.10

Overview

GLM-4.7-Flash is a budget variant at $0.06/$0.40 per 1M tokens. 200K context, image input, but low intelligence.

Benchmarks & Performance

AA Index 22.1, GPQA/HLE 0.5. Designed for high-volume low-complexity tasks.

Detailed Comparison

Cheapest option for basic tasks. Not comparable to full GLM-4.7 or GLM-5.

Community Feedback

Limited attention as budget-tier model.

Use Cases

Text summarization, simple Q&A, content classification at scale.

Latest News

Released January 2026. GLM-5 series now preferred.

GLM-4.7-Flash is a budget variant at $0.06/$0.40 per 1M tokens. 200K context, image input, but low intelligence.

Sources

CloudPrice

Analysis generated: 2026-05-24