모델 목록으로
OpenAI독점

GPT-5.1 Instant

GPT-5.1 Instant은 OpenAI에서 개발한 추론 모델입니다. 400K의 긴 컨텍스트 윈도우를 특징으로 하며, 고급 추론 능력을 제공합니다.

파라미터

Undisclosed

컨텍스트

400K

라이선스

Proprietary

출시일

2025-11-12

API 가격

입력 가격 (1M 토큰당)

$1.25

출력 가격 (1M 토큰당)

$

과금 모드: standard

강점

  • 고급 추론 능력 제공
  • 광범위한 400K 컨텍스트 이해
  • OpenAI의 최신 설계

약점

  • 비오픈소스 라이선스
  • 제한된 사양 공개
  • 폐쇄적 사용 환경

활용 사례

  • 복잡한 논리적 사고가 필요한 작업
  • 긴 문서 분석
  • 고급 추론이 필요한 문제 해결

심층 분석

Release Date

November 12, 2025

Context Window

128K tokens

Max Output

16K tokens

Input Price

$1.25 / 1M tokens

Output Price

$10.00 / 1M tokens

Cache Read

$0.13 / 1M tokens

Latency (P50 TTFT)

0.6s (OpenAI), 1.2s (Azure)

Throughput (P50)

102 TPS

강점

  • Fastest model in the GPT-5.1 family with 0.6s P50 time-to-first-token
  • High throughput at 102 tokens per second for real-time applications
  • Supports tool use, vision, file input, reasoning, and web search
  • Available on both OpenAI and Azure with zero data retention support
  • Good for high-throughput backend APIs with many concurrent requests

약점

  • Limited to 16K max output tokens — not suitable for long-form generation
  • Smaller context window (128K) compared to GPT-5.1 Thinking (410K)
  • Higher hallucination rate than Thinking variant due to reduced reasoning
  • Superseded by GPT-5.2 Chat (Instant) which offers better quality at lower cost
  • Same pricing as GPT-5.1 Thinking ($1.25/$10) despite reduced capabilities

경쟁사 비교

ModelArenaSWEGPQAPrice
Claude Haiku 4~1350~45%~72%$0.25/$1.25 per 1M tokens
Gemini 3 Flash~1370~50%~78%$0.15/$0.60 per 1M tokens
GPT-5.2 Instant~1400~60%~85%$0.875/$7 per 1M tokens
GPT-5.1 Thinking~1400~74%~88%$1.25/$10 per 1M tokens

GPT-5.1 Instant is the fastest model in the GPT-5.1 family, optimized for low-latency responses across general-purpose tasks. Released November 12, 2025, it offers 0.6s time-to-first-token and 102 TPS throughput at $1.25/$10 per 1M tokens. It brings GPT-5.1 generation quality to real-time workloads, though it has been superseded by the cheaper and higher-quality GPT-5.2 Instant.

분석 생성일: 2026-05-24