Developed post-training technology to adapt largest-scale open foundation models to national specifications.
Summary
Sakana AI has developed and released the "Namazu" series, an alpha version of prototype models that adapt existing frontier open-weight foundation models to Japanese cultural values and security requirements through post-training technology. Concurrently, the company launched "Sakana Chat," a chat service featuring the Namazu models, which include Namazu-DeepSeek-V3.1-Terminus, Llama-3.1-Namazu-405B, and Namazu-gpt-oss-120B. The Namazu series maintains performance comparable to its base models across major benchmarks like AIME’25 and MMLU-Redux, while significantly improving neutrality and factual accuracy on politically sensitive topics related to Japan. It also reduces refusal rates on such questions from 72% to nearly 0% for models like DeepSeek-V3.1-Terminus. Sakana Chat integrates a Web search function to provide real-time information.
Key takeaway
For NLP Engineers developing LLM applications for specific national or cultural markets, Sakana AI's Namazu series demonstrates that post-training can effectively mitigate inherent biases and censorship tendencies of global foundation models. You should consider implementing similar post-training methodologies to ensure your models provide neutral, accurate, and culturally appropriate responses, especially for sensitive topics, without compromising core performance. This approach can significantly enhance user trust and regulatory compliance in target regions.
Key insights
Post-training adapts frontier LLMs to specific cultural and security contexts while preserving performance.
Principles
- Open-source LLMs require post-training for regional bias correction.
- Cultural adaptation can reduce model refusal rates on sensitive topics.
Method
Proprietary post-training technology uses unique datasets for bias correction within specific cultural contexts, applied to high-performance open-weight foundation models.
In practice
- Use post-training to align LLMs with local cultural norms.
- Integrate Web search for real-time information in chat services.
Topics
- Post-Training Technology
- Foundation Model Adaptation
- LLM Bias Correction
- Country-Specific AI
- Sakana Chat
Best for: NLP Engineer, AI Engineer, Machine Learning Engineer, AI Product Manager
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Blog.