GPT-5.1 Instant
GPT-5.1 Instant은 OpenAI에서 개발한 추론 모델입니다. 400K의 긴 컨텍스트 윈도우를 특징으로 하며, 고급 추론 능력을 제공합니다.
파라미터
Undisclosed
컨텍스트
400K
라이선스
Proprietary
출시일
2025-11-12
API 가격
입력 가격 (1M 토큰당)
$1.25
출력 가격 (1M 토큰당)
$
과금 모드: standard
강점
- ・고급 추론 능력 제공
- ・광범위한 400K 컨텍스트 이해
- ・OpenAI의 최신 설계
약점
- ・비오픈소스 라이선스
- ・제한된 사양 공개
- ・폐쇄적 사용 환경
활용 사례
- ・복잡한 논리적 사고가 필요한 작업
- ・긴 문서 분석
- ・고급 추론이 필요한 문제 해결
심층 분석
Release Date
November 12, 2025
Context Window
128K tokens
Max Output
16K tokens
Input Price
$1.25 / 1M tokens
Output Price
$10.00 / 1M tokens
Cache Read
$0.13 / 1M tokens
Latency (P50 TTFT)
0.6s (OpenAI), 1.2s (Azure)
Throughput (P50)
102 TPS
강점
- ・Fastest model in the GPT-5.1 family with 0.6s P50 time-to-first-token
- ・High throughput at 102 tokens per second for real-time applications
- ・Supports tool use, vision, file input, reasoning, and web search
- ・Available on both OpenAI and Azure with zero data retention support
- ・Good for high-throughput backend APIs with many concurrent requests
약점
- ・Limited to 16K max output tokens — not suitable for long-form generation
- ・Smaller context window (128K) compared to GPT-5.1 Thinking (410K)
- ・Higher hallucination rate than Thinking variant due to reduced reasoning
- ・Superseded by GPT-5.2 Chat (Instant) which offers better quality at lower cost
- ・Same pricing as GPT-5.1 Thinking ($1.25/$10) despite reduced capabilities
경쟁사 비교
| Model | Arena | SWE | GPQA | Price |
|---|---|---|---|---|
| Claude Haiku 4 | ~1350 | ~45% | ~72% | $0.25/$1.25 per 1M tokens |
| Gemini 3 Flash | ~1370 | ~50% | ~78% | $0.15/$0.60 per 1M tokens |
| GPT-5.2 Instant | ~1400 | ~60% | ~85% | $0.875/$7 per 1M tokens |
| GPT-5.1 Thinking | ~1400 | ~74% | ~88% | $1.25/$10 per 1M tokens |
GPT-5.1 Instant is the fastest model in the GPT-5.1 family, optimized for low-latency responses across general-purpose tasks. Released November 12, 2025, it offers 0.6s time-to-first-token and 102 TPS throughput at $1.25/$10 per 1M tokens. It brings GPT-5.1 generation quality to real-time workloads, though it has been superseded by the cheaper and higher-quality GPT-5.2 Instant.
출처
분석 생성일: 2026-05-24