모델 목록으로
Anthropic독점

Claude Opus 4.7

Anthropic's high-performance chat model. Based on the Mythos architecture, it is optimized for conversational tasks. Equipped with the Managed Agents function, it can handle complex business workflow automation.

파라미터

Undisclosed

컨텍스트

200K

라이선스

Proprietary

출시일

2026-04-16

일본어 처리 능력

High-Quality JP

Multilingual model with strong Japanese language processing capabilities.

API 가격

입력 가격 (1M 토큰당)

$15

출력 가격 (1M 토큰당)

$75

과금 모드: standard

강점

  • Business automation with Managed Agents
  • Cost reduction via caching
  • High conversation quality
  • 50% discount with Batch API

약점

  • Standard pricing is among the highest
  • Not open-source
  • Lower reasoning performance than dedicated reasoning models

활용 사례

  • Business process automation
  • Chatbots and dialogue systems
  • Customer support AI
  • Tasks requiring long-term dialogue context

심층 분석

GDPval-AA Elo

1,753

#1 overall, +79 Elo over nearest competitors

SWE-bench Verified

87.6%

vs GPT-5.4: N/A, Gemini 3.1 Pro: 80.6%

SWE-bench Pro

64.3%

vs GPT-5.4: 57.7%, Gemini 3.1 Pro: 54.2%

MCP-Atlas (Tool Use)

77.3%

#1 among all available models

GPQA Diamond

94.2%

vs GPT-5.4 Pro: 94.4%, Gemini 3.1 Pro: 94.3%

Input / Output Price

$5 / $25 per 1M tokens

new tokenizer inflates token count up to 35%

강점

  • Best-in-class agentic coding with SWE-bench Pro at 64.3% (+10.9pp over Opus 4.6) and self-verification catching errors before reporting
  • Leading tool use orchestration at 77.3% MCP-Atlas with task budgets enabling controlled long-running agent workflows
  • 3.75 megapixel vision (3.3× prior Claude) with dramatic CharXiv reasoning gains (+13pp without tools)

약점

  • Long-context retrieval collapsed: MRCR v2 8-needle at 1M tokens dropped from 78.3% to 32.2% vs Opus 4.6
  • BrowseComp web research regressed from 83.7% to 79.3%, trailing GPT-5.5 (84.4%) and Gemini 3.1 Pro (85.9%)
  • New tokenizer inflates token counts by 1.0–1.35× on same input, effectively raising per-task API costs up to 35%

경쟁사 비교

ModelArenaGPQAPrice
GPT-5.5Tied (#1 on AI Index)93.6%$5/$30
Gemini 3.1 ProTied (#1 on AI Index)94.3%$2/$12
Claude Opus 4.6#491.3%$5/$25

Claude Opus 4.7, released April 16, 2026, is Anthropic's most capable generally available model and the first to ship with production cybersecurity safeguards developed under Project Glasswing. Built on the Mythos architecture, it ties with GPT-5.5 and Gemini 3.1 Pro atop the Artificial Analysis Intelligence Index (score 57) while leading GDPval-AA—a benchmark measuring economically valuable knowledge work across 44 occupations—by 79 Elo points. The model represents a targeted upgrade over Opus 4.6, with improvements concentrated in agentic coding (+10.9pp on SWE-bench Pro), multi-tool orchestration (77.3% MCP-Atlas, #1 among available models), and visual reasoning (+13pp on CharXiv). A new self-verification capability causes the model to check its own work before reporting, reducing confident-but-wrong outputs and enabling more autonomous long-running workflows.

However, the release comes with real trade-offs. Long-context retrieval performance regressed sharply—MRCR v2 8-needle at 1M tokens dropped from 78.3% to 32.2%—and BrowseComp web research fell 4.4 points, trailing both GPT-5.5 and Gemini 3.1 Pro. A new tokenizer inflates token counts by up to 35% on identical inputs, meaning effective per-task costs rise despite unchanged per-token pricing. Anthropic also deliberately reduced Opus 4.7's cybersecurity capabilities during training, making it the first commercially available model intentionally constrained in a specific domain for safety reasons. This positions it as a bridge to the more powerful but restricted Claude Mythos Preview.

The pricing remains at $5/$25 per 1M input/output tokens (with 90% cache discounts and 50% batch discounts available), and the model maintains the 1M-token context window and 128K max output of its predecessor. New features include an 'xhigh' effort level for finer reasoning control, task budgets in public beta for token-guided agentic loops, and vision resolution increased to 3.75 megapixels. Developer reception is strongly positive for coding and agent workflows, though the long-context regression and tokenizer cost increase have drawn sharp criticism in the community.

분석 생성일: 2026-05-23