모델 목록으로
Sakana AI오픈소스

Namazu-DeepSeek-V3.1-Terminus

An open-source LLM specialized for Japan, developed by Sakana AI. Based on DeepSeek-V3.1-Terminus, this model has been fine-tuned through post-training to correct biases to fit Japanese cultural and social contexts. It features significant improvements in neutrality and accuracy in themes related to politics, history, and diplomacy.

파라미터

685B (MoE)

컨텍스트

128K

라이선스

Apache 2.0

출시일

2026-03-15

일본어 처리 능력

🇯🇵Native JP

Model developed by a Japanese company or specialized for Japanese. Highest Japanese understanding and generation capability.

API 가격

이 모델의 API 가격 정보는 현재 공개되지 않았습니다

강점

  • Optimized for Japanese cultural context
  • Improved answer refusal rate from 72% to nearly 0%
  • Open-source (Apache 2.0)
  • Maintains inference performance equal to the base model

약점

  • Relies on the base model (only post-training applied)
  • Limited API availability
  • Few evaluations on global benchmarks
  • Still in alpha stage

활용 사례

  • Japanese chatbot
  • Q&A on political/historical topics
  • Content generation on Japanese culture/society
  • Education and research use

심층 분석

Base Model

DeepSeek-V3.1-Terminus

671B params, 37B active

Context Window

163,840 tokens

Extended from base 128K

Input Price

$0.27/1M tokens

via DeepInfra

Output Price

$0.95/1M tokens

via DeepInfra

Specialization

Japanese Cultural & Social Contexts

Corrected biases for Japan

License

MIT

Open-source, commercially permissive

강점

  • Specialized post-training corrects cultural and historical biases for Japanese contexts
  • Open-source MIT license allows commercial use and self-hosting
  • Significantly cheaper than comparable Western frontier models
  • Maintains strong coding and agentic capabilities from DeepSeek-V3.1-Terminus base
  • Extended 163K context window from some providers

약점

  • Limited to no global benchmark data; performance in non-Japanese contexts unverified
  • Dependent on DeepSeek V3.1-Terminus base, which trails newer DeepSeek V4 models
  • Regional focus may limit multilingual versatility outside Japanese use cases
  • Potential for residual biases or inaccuracies in specialized domains
  • Community and ecosystem much smaller than major model families

경쟁사 비교

ModelArenaSWEGPQAPrice
DeepSeek-V3.1-Terminus (Original)N/A68.4%80.7%$0.27/$0.95 (per 1M tokens)
DeepSeek-V4 ProN/A~68.5% (Verified)Higher than V3.1-T$0.30/$0.50 (per 1M tokens)
Sakana's Japanese-optimized LLM (Hypothetical)N/AN/AN/ANot publicly listed

Namazu-DeepSeek-V3.1-Terminus is a specialized open-source large language model developed by Sakana AI, tailored specifically for Japanese linguistic and cultural contexts. Built upon the DeepSeek-V3.1-Terminus architecture (671B total parameters, 37B active per token), it undergoes targeted post-training to correct inherent biases and improve neutrality in sensitive areas such as politics, history, and diplomacy relevant to Japan. The model aims to provide more accurate and contextually appropriate responses for Japanese users and applications, addressing a critical gap where general-purpose models may fall short.

The model inherits the strong technical foundation of DeepSeek-V3.1-Terminus, including its hybrid reasoning capabilities (thinking/non-thinking modes), support for structured tool calling, and competitive performance in coding and agentic benchmarks like SWE-Bench. However, its primary value proposition lies in its specialized alignment rather than raw benchmark dominance. It is positioned as a niche yet essential tool for developers and organizations requiring an AI that deeply understands Japanese societal norms and avoids Western-centric biases.

From an operational standpoint, Namazu is deployed via API through infrastructure partners like DeepInfra at a highly competitive price point, undercutting most Western frontier models by an order of magnitude. Its open-source MIT license further enhances its accessibility for local deployment and customization. While it represents a significant step forward for Japan-focused AI, its performance on global benchmarks remains unpublicized, and it operates in a highly specialized segment of the market.

분석 생성일: 2026-05-23