China’s Coding Model, Kimi K2.7 Code, is 6x Cheaper Than Claude. It also Grades Its Own Homework
Summary
Moonshot AI released Kimi K2.7 Code on June 12, 2026, an open-weight, coding-specialized model priced at \$0.95 per million input tokens and \$4.00 per million output tokens. This makes it approximately six times cheaper than Claude Opus 4.8, which costs \$5 and \$25 for input and output respectively. Kimi K2.7 Code features a 256K context window and a trillion parameters using a mixture-of-experts architecture. However, all benchmarks published for K2.7 Code, including Kimi Code Bench v2 and Program Bench, are proprietary and lack independent verification on public leaderboards like SWE-bench Verified. Furthermore, on its own in-house scorecards, K2.7 Code loses to GPT-5.5 on all six metrics and to Claude Opus 4.8 on five of six.
Key takeaway
For AI Engineers evaluating coding models, you should critically assess performance claims, especially when cost savings are significant. While Kimi K2.7 Code offers a compelling price point at \$4.00 per million output tokens, its reliance on unverified, in-house benchmarks and underperformance against competitors on those same metrics suggests caution. Prioritize models with transparent, publicly verifiable benchmarks to ensure you are not compromising code quality for cost efficiency.
Key insights
Kimi K2.7 Code offers extreme cost efficiency but lacks independent performance verification, raising trust concerns.
Principles
- Cost-efficiency can mask performance gaps.
- Proprietary benchmarks hinder trust.
- Open-weight models challenge market pricing.
In practice
- Evaluate models beyond stated costs.
- Prioritize models with public benchmarks.
- Scrutinize vendor-specific performance claims.
Topics
- Kimi K2.7 Code
- Code Generation
- Model Benchmarking
- Cost Efficiency
- Open-weight Models
- Mixture-of-Experts
Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, AI Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Towards AI - Medium.