Back to Models
Zhipu AIOpen Source

GLM-4.7-Flash

GLM-4.7-Flash is an inference model developed by Zhipu AI. It has a parameter scale of approximately 310.0B and supports a long context window of 200K.

Parameters

310.0B

Context Window

200K

License

MIT

Release Date

2026-01-19

API Pricing

API pricing for this model is not yet available

Strengths

  • Large 310B parameter count
  • 200K broad context length
  • Openness via MIT license

Weaknesses

  • Large 62.5GB file size
  • High hardware requirements
  • Potential for increased inference cost

Use Cases

  • Complex logical reasoning tasks
  • Analysis of long documents
  • Open-source AI development

Deep Analysis

Parameters

~310B MoE

Context Window

200K tokens

AA Intelligence Index

22.1

Output Speed

87.8 tok/s

Pricing

$0.06/$0.40 per 1M tokens

Release Date

January 19, 2026

Strengths

  • Extremely affordable ($0.06/$0.40)
  • 200K context at budget price
  • Text+image input
  • Good speed (87.8 tok/s)

Weaknesses

  • Low intelligence (AA Index 22.1)
  • GPQA/HLE 0.5 each
  • Not for complex reasoning
  • Superseded by GLM-5

Competitor Comparison

ModelPrice
GPT-4o-mini$0.15/$0.60
DeepSeek V3-Chat$0.27/$1.10

GLM-4.7-Flash is a budget variant at $0.06/$0.40 per 1M tokens. 200K context, image input, but low intelligence.

Sources

Analysis generated: 2026-05-24