Cursor's Composer 2.5 matches Opus 4.7 and GPT-5.5 benchmarks at a fraction of the cost
Summary
Cursor has released Composer 2.5, a significant update to its in-house AI coding model, building on Moonshot's open-source Kimi K2.5 checkpoint. This new iteration was trained with 25 times more synthetic tasks than its predecessor, Composer 2, allocating 85 percent of its compute budget to additional training and reinforcement learning. Composer 2.5 achieves benchmark scores of 79.8 percent on SWE-Bench Multilingual and 63.2 percent on CursorBench v3.1, matching the performance of Opus 4.7 and GPT-5.5. Notably, its pricing is substantially lower, at $0.50 per million input tokens and $2.50 per million output tokens, with a faster variant available for $3.00 and $15.00, respectively. Cursor is also developing a larger successor model "from scratch" with SpaceX and xAI, utilizing ten times the compute on the Colossus-2 cluster.
Key takeaway
For AI architects and engineering leaders evaluating coding assistants, Composer 2.5 offers a compelling alternative to higher-priced models like Opus 4.7 and GPT-5.5. Its comparable benchmark performance at significantly lower token costs could optimize your development budget without sacrificing code quality. Consider integrating Composer 2.5 into your workflows to reduce operational expenses.
Key insights
Composer 2.5 matches top-tier coding AI performance at a fraction of the cost through extensive synthetic task training.
Principles
- Synthetic data scales model capabilities.
- Cost-efficiency is achievable with competitive performance.
Method
Composer 2.5's development involved building on an open-source checkpoint, training on 25x more synthetic tasks, and dedicating 85% of compute to reinforcement learning.
In practice
- Evaluate Composer 2.5 for cost-sensitive coding tasks.
- Consider synthetic data generation for model improvement.
Topics
- Composer 2.5
- AI Coding Model
- Model Benchmarks
- Token Pricing
- Kimi K2.5
Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.