Back to Models
Meta AIProprietary

Muse Spark by Meta Superintelligence Labs

Muse Spark by Meta Superintelligence Labs is the first reasoning model announced by Meta after a comprehensive reorganization of its AI research structure. It features native support for multimodal input and an integrated multi-agent parallel reasoning mechanism.

Parameters

Undisclosed

Context Window

License

Proprietary

Release Date

2026-04-08

API Pricing

API pricing for this model is not yet available

Strengths

  • Advanced medical Q&A capabilities
  • Excellent chart and diagram understanding
  • Parallel reasoning mechanism

Weaknesses

  • Reasoning power below GPT-5.4
  • Lower performance than Gemini 3.1
  • Insufficient agent implementation capabilities

Use Cases

  • Professional medical consultation
  • Complex diagram and chart analysis
  • Multimodal reasoning

Deep Analysis

Arena Elo (Overall)

1489

Trails GPT-5.4 (1672) and Claude Opus 4.6 (1606)

HealthBench Hard

42.8

Leads all competitors; GPT-5.4: 40.1, Gemini: 20.6

Humanity's Last Exam (Contemplating)

50.2% (no tools)

Beats GPT-5.4 Pro (43.9%) and Gemini Deep Think (48.4%)

ARC-AGI-2 (Abstract Reasoning)

42.5

Significant gap vs. GPT-5.4 (76.1) and Gemini (76.5)

SWE-Bench Verified (Coding)

77.4%

Behind Claude Opus 4.6 (80.8%) and Gemini 3.1 Pro (80.6%)

Pricing

Free

No subscription; competitors charge $20+/month

Context Window

262K tokens

Smaller than Gemini's 1M token window

Token Efficiency

58M output tokens (Intelligence Index eval)

Matches Gemini; far less than Claude (157M) or GPT-5.4 (120M)

Strengths

  • Industry-leading health and medical reasoning capabilities
  • Completely free access via Meta AI app and website with no subscription
  • Unique Contemplating mode with parallel multi-agent reasoning for complex tasks
  • Exceptional token efficiency and multimodal vision performance

Weaknesses

  • Significantly trails competitors in abstract reasoning (ARC-AGI-2) and agentic coding
  • No public API, desktop apps, or open weights currently available
  • Limited to Meta's ecosystem; no integration with external developer tools
  • Evaluation-aware behavior raises questions about alignment consistency

Competitor Comparison

ModelArenaSWEGPQAPrice
GPT-5.4167257.7%~94.3%$200/month (Pro)
Claude Opus 4.6160680.8%92.7%$20/month (Pro)
Gemini 3.1 Pro~1480~80.6%94.3%$19.99/month (Google AI Pro)

Muse Spark represents Meta's strategic pivot from open-source models to a proprietary, product-first AI system under the newly formed Meta Superintelligence Labs. As the first model in the Muse family, it introduces natively multimodal architecture, novel test-time reasoning with Contemplating mode (parallel multi-agent orchestration), and a strong focus on health applications. The model is designed to scale efficiently across Meta's 3+ billion daily active users, leveraging "thought compression" to reduce token usage by up to 2.7x compared to competitors.

While Muse Spark doesn't top every benchmark, it carves out distinct niches: it leads all competitors in health reasoning (HealthBench Hard: 42.8), excels in vision-grounded tasks (MMMU-Pro: 80.5%), and offers the most cost-effective access to frontier-tier AI as a completely free service. Its weaknesses are concentrated in abstract reasoning (ARC-AGI-2: 42.5 vs. 76.1 for GPT-5.4) and autonomous agentic tasks (GDPval-AA Elo: 1444 vs. 1672 for GPT-5.4).

The launch signals Meta's commitment to building personal superintelligence through its massive distribution advantage rather than pure benchmark leadership. With larger models already in development and plans for future open-source releases, Muse Spark establishes the foundation for Meta's AI ecosystem integration across social platforms, wearables, and consumer applications.

Analysis generated: 2026-05-23