China’s Coding Model, Kimi K2.7 Code, is 6x Cheaper Than Claude. It also Grades Its Own Homework

· Source: Towards AI - Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Advanced, quick

Summary

Moonshot AI released Kimi K2.7 Code on June 12, 2026, an open-weight, coding-specialized model priced at \$0.95 per million input tokens and \$4.00 per million output tokens. This makes it approximately six times cheaper than Claude Opus 4.8, which costs \$5 and \$25 for input and output respectively. Kimi K2.7 Code features a 256K context window and a trillion parameters using a mixture-of-experts architecture. However, all benchmarks published for K2.7 Code, including Kimi Code Bench v2 and Program Bench, are proprietary and lack independent verification on public leaderboards like SWE-bench Verified. Furthermore, on its own in-house scorecards, K2.7 Code loses to GPT-5.5 on all six metrics and to Claude Opus 4.8 on five of six.

Key takeaway

For AI Engineers evaluating coding models, you should critically assess performance claims, especially when cost savings are significant. While Kimi K2.7 Code offers a compelling price point at \$4.00 per million output tokens, its reliance on unverified, in-house benchmarks and underperformance against competitors on those same metrics suggests caution. Prioritize models with transparent, publicly verifiable benchmarks to ensure you are not compromising code quality for cost efficiency.

Key insights

Kimi K2.7 Code offers extreme cost efficiency but lacks independent performance verification, raising trust concerns.

Principles

In practice

Topics

Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, AI Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Towards AI - Medium.