Meituan open sources LongCat-2.0, the 1.6T, near-frontier agentic coding model that's been leading OpenRouter — trained entirely on Chinese chips

· Source: VentureBeat · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Expert, medium

Summary

Meituan has open-sourced LongCat-2.0, a 1.6-trillion-parameter Mixture-of-Experts (MoE) agentic coding model, under an MIT license. This model, previously known as "Owl Alpha" and a top performer on OpenRouter, features a native 1-million-token context window. Crucially, LongCat-2.0 was trained entirely on a cluster of over 50,000 domestic Chinese ASICs, demonstrating the capability to scale near-frontier AI without reliance on U.S. Nvidia GPUs. Its commercial framework includes aggressive pricing, with a limited-time promotional rate of \$0.30 per million input tokens and \$1.20 for output, and uniquely offers free processing for context-cache hits. The model excels in software engineering tasks, scoring 59.5 on SWE-bench Pro, surpassing GPT-5.5's 58.6, and utilizes a Multi-Teacher Optimization via Mixture of Specialized Experts (MOPD) post-training framework for specialized agentic, reasoning, and interaction capabilities.

Key takeaway

For AI Engineers and Directors of AI/ML evaluating large language models for autonomous software engineering, LongCat-2.0 presents a compelling, cost-effective alternative. Its 1-million-token context window and zero-cost cache hits significantly reduce operational expenses for iterative coding tasks. You should consider integrating this MIT-licensed model to bypass data privacy concerns and high API costs of proprietary systems, especially for codebase migrations or continuous infrastructure optimization.

Key insights

Meituan's LongCat-2.0 demonstrates near-frontier AI can be developed on domestic Chinese ASICs with competitive performance and cost.

Principles

Method

LongCat-2.0 uses LongCat Sparse Attention (LSA) with Streaming-aware, Cross-Layer, and Hierarchical Indexing for 1M-token context. Post-training employs MOPD, segregating Agent, Reasoning, and Interaction Experts.

In practice

Topics

Code references

Best for: CTO, VP of Engineering/Data, Entrepreneur, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by VentureBeat.