모델 목록으로
OpenAI독점

OpenAI GPT-5.1-Codex-Max

OpenAI GPT-5.1-Codex-Max는 OpenAI에서 개발한 프로그래밍 전문 기반 모델입니다. 매우 긴 400K의 컨텍스트 윈도우를 특징으로 하여 대규모 코드베이스 처리에 적합합니다.

파라미터

Undisclosed

컨텍스트

400K

라이선스

Proprietary

출시일

2025-11-19

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • 고급 코딩 기능
  • 대규모 400K 컨텍스트 윈도우
  • OpenAI에 의해 최적화됨

약점

  • 비공개 소스 라이선스에 의해 제한됨
  • 외부 접근 제한됨
  • 폐쇄적 사용 조건

활용 사례

  • 대규모 코드베이스 분석
  • 복잡한 프로그램 구현
  • 고급 버그 수정

심층 분석

Release Date

November 19, 2025

Context Window

Effectively unlimited (compaction)

Input Price

$1.25 / 1M tokens

Output Price

$10.00 / 1M tokens

Cached Input

$0.625 / 1M tokens

SWE-bench Verified

77.9% (xhigh)

Terminal-Bench 2.0

58.1%

SWE-Lancer IC SWE

79.9%

Autonomous Operation

24+ hours continuous

Throughput

58.4 tok/s avg (11-110 range)

강점

  • First model with context compaction — effectively unlimited context through iterative summarization
  • SWE-bench Verified 77.9% with 30% fewer thinking tokens than predecessor
  • Autonomous operation for 24+ hours on complex tasks
  • Native Windows support — first OpenAI coding model to offer this
  • Configurable reasoning effort (none/medium/high/xhigh) for cost/quality tradeoffs

약점

  • High latency (2,060ms avg TTFT) with significant variability (169.3% CV)
  • Context compaction can 'blur' details over very long sessions
  • METR evaluation suggests 80% reliability time-horizon is ~2 hours, not 24
  • Follows instructions very literally — may not recognize obvious typos
  • Higher code churn compared to Claude Code (30% more reworks)

경쟁사 비교

ModelArenaSWEGPQAPrice
Claude Opus 4.5~145080.9%~92%$15/$75 per 1M tokens
Gemini 3 Pro~142076.2%~90%$3.50/$10.50 per 1M tokens
GPT-5.1 Codex~140073.7%~88%$1.25/$10 per 1M tokens
Cursor (varies)N/AVariesN/A$20/month subscription

GPT-5.1-Codex-Max is OpenAI's frontier agentic coding model released November 19, 2025, featuring revolutionary context compaction technology for effectively unlimited context. It achieves SWE-bench Verified 77.9% with 30% fewer thinking tokens than its predecessor and can operate autonomously for over 24 hours. It replaced GPT-5.1-Codex as the default across all Codex surfaces.

분석 생성일: 2026-05-24