모델 목록으로
Google DeepMind조건부 오픈

Gemma 4 31B

The latest version of Google DeepMind's lightweight open model. With 31B parameters, it provides efficient performance, and commercial use is possible under the Gemma license (with conditions). It is practical for operation in local environments, making it suitable for use in settings with strict privacy requirements.

파라미터

31B

컨텍스트

128K

라이선스

Gemma License

출시일

2026-04-06

일본어 처리 능력

High-Quality JP

Multilingual model with strong Japanese language processing capabilities.

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • Lightweight, suitable for local operation
  • Commercial use allowed under Gemma License (conditional)
  • Based on Google's technology
  • Active community

약점

  • Gemma License includes non-commercial restrictions
  • Limited Japanese capabilities
  • Large performance gap vs frontier models
  • No API available (self-hosted only)

활용 사례

  • AI utilization in local environments
  • Privacy-focused applications
  • Model fine-tuning
  • Research and experimental use

심층 분석

Arena Elo

1451

Overall text rank #3 among open models

GPQA Diamond

84.3%

Strong scientific reasoning

LiveCodeBench v6

80.0%

Excellent coding performance

Input Price

$0.14/1M

Via API providers, free to self-host

Context Window

256K tokens

Effective long-context performance

Parameters

31B dense

Active for every inference step

강점

  • Outstanding reasoning and coding benchmarks for a 31B parameter model
  • Apache 2.0 license allows unrestricted commercial use and fine-tuning
  • Strong multimodal capabilities with native thinking/reasoning mode

약점

  • Slower inference speed (6-8 tok/s locally) compared to MoE models
  • No audio support (only text and image modalities)
  • Requires significant VRAM (20GB+ for quantized, 34GB+ for 8-bit)

경쟁사 비교

ModelArenaSWEGPQAPrice
Claude 3.5 Sonnet~1500 (estimated)79.6%~85% (estimated)$3/$15 per 1M tokens
Llama 4 Maverick (400B)N/AN/AN/AFree self-host, $0 API via providers
Mistral Large 2~1480 (estimated)~80% (estimated)~82% (estimated)$2/$6 per 1M tokens

Gemma 4 31B represents Google DeepMind's flagship open-weight model released April 2, 2026, delivering frontier-level performance in a 31B dense architecture. The model achieves remarkable benchmark scores—89.2% on AIME 2026 math, 80% on LiveCodeBench v6 coding, and 84.3% on GPQA Diamond scientific reasoning—marking generational improvements over Gemma 3 (which scored 20.8% on AIME). Under Apache 2.0 licensing, it enables unrestricted commercial use, fine-tuning, and deployment without MAU restrictions.

Designed for developers needing strong reasoning, coding, and multimodal capabilities, the model supports text and image inputs with configurable chain-of-thought reasoning. While its dense architecture makes it slower than Mixture-of-Experts alternatives (6-8 tok/s locally vs 50+ tok/s for Gemma 4 26B A4B MoE), it offers the highest quality ceiling within the Gemma family. The 256K context window with improved retrieval reliability (66.4% on multi-needle tests) enables practical long-document processing.

The model positions itself as the premier open-weight alternative to commercial API models, particularly for teams requiring data sovereignty, custom fine-tuning, or cost-controlled self-hosting. Recent community adoption shows strong interest in coding assistance, research applications, and agentic workflows, though hardware requirements limit casual local use.

분석 생성일: 2026-05-23