모델 비교

인기 AI 모델을 성능, 가격, 기능으로 비교

VS
SWE-Bench Pro
GPT-5.5

OpenAI

69.2SWE-Bench Pro58.6
93.6GPQA Diamond93.6
ARC-AGI-285
57.9HLE52.2
FrontierMath35.4
88.6SWE-bench Verified
VS
Price/Perf
GPT-5.2

OpenAI

40.9SWE-Bench Pro55.6
GPQA Diamond92.4
4ARC-AGI-254.2
25.1HLE45.5
2.1FrontierMath18.8
73.1SWE-bench Verified80
Gemma 4 31B

Google DeepMind

VS
Local Deploy
GPQA Diamond87.8
26.5HLE24
SWE-bench Verified77.2