Back to Leaderboard

OpenClaw Ranking

OpenClaw agent performance: Claw Bench and Pinch Bench.

698 models

#ModelDeveloperOpen Source
1GLM-5-TurboZhipu AI93.886.5Closed
2Doubao Seed 2.0 Lite字节跳动Seed团队93.1Closed
3GPT-5.4OpenAI92.790.5Closed
4MiniMax M2.5MiniMax92.187.8Closed
5GLM-5Zhipu AI91.786.4Closed
6MiniMax-M2.7MiniMax91.787.1Closed
7Opus 4.5Anthropic91.587.2Closed
8Qwen3.5-35B-A3Bアリババ91.478.4Closed
9GLM-5V-TurboZhipu AI90.1Closed
10GPT-5.4 nanoOpenAI89.7Closed
11Haiku 4.5Anthropic89.482.0Closed
12Grok 4.1 FastxAI88.682.4Closed
13Claude Sonnet 4.5Anthropic88.188.2Closed
14Qwen3.5-122B-A10Bアリババ86.085.5Closed
15Gemini 3.0 FlashGoogle DeepMind85.785.2Closed
16Step 3.5 FlashStepFun84.985.3Closed
17Kimi K2 ThinkingMoonshot AI82.5Closed
18Kimi K2.5Moonshot AI81.784.8Closed
19Kimi K2.6Moonshot AI80.9Closed
20Gemini 2.5 Pro Experimental 03-25Google DeepMind80.471.9Closed
21DeepSeek V3.2DeepSeek79.084.3Closed
22Mistral Large 3Mistral78.672.2Closed
23Claude Sonnet 4Anthropic77.880.5Closed
24Qwen3-Coder-Nextアリババ75.879.1Closed
25GPT-5.4 miniOpenAI75.3Closed
26Qwen3.5-27Bアリババ75.290.0Closed
27Qwen3.6-27Bアリババ72.4Closed
28Nova 2 Liteアマゾン68.5Closed
29ERNIE 5.0 Thinking Previewバイドゥ51.0Closed
30Claude Mythos PreviewAnthropicClosed

About Benchmarks

Claw Bench
OpenClawエージェントベンチマーク — OpenClawプラットフォームでのエージェント性能を測定
Pinch Bench
OpenClawピンチベンチマーク — OpenClawプラットフォームでのタスク遂行能力を測定