모델 목록으로
DeepMind오픈소스

Gemma 4 26B A4B(混合专家模型)

Gemma 4 26B A4B는 DeepMind가 개발한 기반 모델로, 전문가 혼합(MoE) 아키텍처를 채택했습니다. 약 25.2B 파라미터를 갖추고 채팅 형식의 상호작용에 최적화되었습니다.

파라미터

25.2B

컨텍스트

256K

라이선스

Apache 2.0

출시일

2026-04

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • MoE를 통한 효율적인 추론
  • 256K 긴 컨텍스트 윈도우
  • 오픈 Apache 2.0 라이선스

약점

  • 특정 작업에 대한 전문성 부족
  • 대형 모델 대비 성능 격차
  • 운영 비용 변동 가능성

활용 사례

  • 고급 챗봇 개발
  • 장문 문서 분석
  • 오픈 소스 AI 구축

심층 분석

Arena Elo (Text Overall)

1438

From BenchLM (7,777 votes)

Arena Elo (Coding)

1481

Strong coding-specific performance

GPQA Diamond (Reasoning)

82.3%

vs Gemma 4 31B: 84.3%

AIME 2026 (Math)

88.3%

High math-reasoning capability

Active Parameters

~3.8B

Out of 25.2B total (MoE)

Input/Output Price

$0.13 / $0.40 per 1M tokens

Blended ~$0.20/M

강점

  • Exceptional efficiency: MoE architecture with only ~3.8B active parameters delivers performance rivaling much larger dense models.
  • Strong mathematical and coding reasoning, with high AIME and LiveCodeBench scores.
  • Apache 2.0 license simplifies commercial adoption and deployment compared to previous Gemma versions.

약점

  • Overall Arena Elo (1438) lags behind top proprietary models (e.g., Gemini 3.1 Pro) and even its dense 31B sibling.
  • Agentic performance is a noted weakness, with low scores on benchmarks like Terminal-Bench (13.6) and HLE (8.7).
  • Lacks native audio support found in smaller Gemma 4 E2B/E4B models, limiting some multimodal use cases.

경쟁사 비교

ModelArenaSWEGPQAPrice
Gemini 3.1 Pro (Google)~1480+N/A~92%+Proprietary/Subscription
Gemma 4 31B Dense (Google)145280.0%*84.3%$0.13/$0.40 per 1M tokens
Llama 4 (Meta)~1460*N/A~89%*Open Weight

Gemma 4 26B A4B is Google DeepMind's efficiency-focused open-weight model released in April 2026. As a Mixture-of-Experts (MoE) model, its defining characteristic is achieving near-frontier performance with minimal active parameters (~3.8B), making it highly cost-effective for inference while handling substantial 256K context windows. This positions it as a compelling "sweet spot" for developers and researchers who need strong capabilities—particularly in reasoning, coding, and multimodal tasks—without the computational demands of its larger 31B dense sibling or the resource requirements of proprietary frontier models.

The model excels in structured reasoning tasks, demonstrated by top-tier scores on benchmarks like AIME (math) and LiveCodeBench (coding). Its release under the permissive Apache 2.0 license marks a significant shift from earlier Gemma models, greatly simplifying commercial and private deployment. While it may not surpass the absolute best proprietary models in every benchmark, its combination of strong performance, architectural efficiency, and open licensing makes it a highly practical choice for a wide range of applications, from on-device assistive agents to large-scale enterprise pipelines.

분석 생성일: 2026-05-23