모델 목록으로
Meta AI독점

Muse Spark by Meta Superintelligence Labs

Meta Superintelligence Labs의 Muse Spark는 Meta가 AI 연구 구조를 전면 재편한 후 발표한 최초의 추론 모델입니다. 멀티모달 입력에 대한 네이티브 지원과 통합된 다중 에이전트 병렬 추론 메커니즘을 특징으로 합니다.

파라미터

Undisclosed

컨텍스트

라이선스

Proprietary

출시일

2026-04-08

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • 고급 의료 Q&A 능력
  • 우수한 차트 및 다이어그램 이해 능력
  • 병렬 추론 메커니즘

약점

  • GPT-5.4에 못 미치는 추론력
  • Gemini 3.1보다 낮은 성능
  • 불충분한 에이전트 구현 능력

활용 사례

  • 전문 의료 상담
  • 복잡한 다이어그램 및 차트 분석
  • 멀티모달 추론

심층 분석

Arena Elo (Overall)

1489

Trails GPT-5.4 (1672) and Claude Opus 4.6 (1606)

HealthBench Hard

42.8

Leads all competitors; GPT-5.4: 40.1, Gemini: 20.6

Humanity's Last Exam (Contemplating)

50.2% (no tools)

Beats GPT-5.4 Pro (43.9%) and Gemini Deep Think (48.4%)

ARC-AGI-2 (Abstract Reasoning)

42.5

Significant gap vs. GPT-5.4 (76.1) and Gemini (76.5)

SWE-Bench Verified (Coding)

77.4%

Behind Claude Opus 4.6 (80.8%) and Gemini 3.1 Pro (80.6%)

Pricing

Free

No subscription; competitors charge $20+/month

Context Window

262K tokens

Smaller than Gemini's 1M token window

Token Efficiency

58M output tokens (Intelligence Index eval)

Matches Gemini; far less than Claude (157M) or GPT-5.4 (120M)

강점

  • Industry-leading health and medical reasoning capabilities
  • Completely free access via Meta AI app and website with no subscription
  • Unique Contemplating mode with parallel multi-agent reasoning for complex tasks
  • Exceptional token efficiency and multimodal vision performance

약점

  • Significantly trails competitors in abstract reasoning (ARC-AGI-2) and agentic coding
  • No public API, desktop apps, or open weights currently available
  • Limited to Meta's ecosystem; no integration with external developer tools
  • Evaluation-aware behavior raises questions about alignment consistency

경쟁사 비교

ModelArenaSWEGPQAPrice
GPT-5.4167257.7%~94.3%$200/month (Pro)
Claude Opus 4.6160680.8%92.7%$20/month (Pro)
Gemini 3.1 Pro~1480~80.6%94.3%$19.99/month (Google AI Pro)

Muse Spark represents Meta's strategic pivot from open-source models to a proprietary, product-first AI system under the newly formed Meta Superintelligence Labs. As the first model in the Muse family, it introduces natively multimodal architecture, novel test-time reasoning with Contemplating mode (parallel multi-agent orchestration), and a strong focus on health applications. The model is designed to scale efficiently across Meta's 3+ billion daily active users, leveraging "thought compression" to reduce token usage by up to 2.7x compared to competitors.

While Muse Spark doesn't top every benchmark, it carves out distinct niches: it leads all competitors in health reasoning (HealthBench Hard: 42.8), excels in vision-grounded tasks (MMMU-Pro: 80.5%), and offers the most cost-effective access to frontier-tier AI as a completely free service. Its weaknesses are concentrated in abstract reasoning (ARC-AGI-2: 42.5 vs. 76.1 for GPT-5.4) and autonomous agentic tasks (GDPval-AA Elo: 1444 vs. 1672 for GPT-5.4).

The launch signals Meta's commitment to building personal superintelligence through its massive distribution advantage rather than pure benchmark leadership. With larger models already in development and plans for future open-source releases, Muse Spark establishes the foundation for Meta's AI ecosystem integration across social platforms, wearables, and consumer applications.

분석 생성일: 2026-05-23