Introducing Composer 2.5
Summary
Cursor has released Composer 2.5, an enhanced version of its AI assistant, now available within the Cursor environment. This iteration significantly improves intelligence and behavior over Composer 2, demonstrating better sustained performance on long tasks, more reliable adherence to complex instructions, and improved collaborative interaction. The advancements stem from scaled training, more complex Reinforcement Learning (RL) environments, and new learning methods. Composer 2.5 is built on Moonshot's Kimi K2.5 open-source checkpoint, similar to its predecessor. Cursor is also collaborating with SpaceXAI to train a substantially larger model from scratch, utilizing 10x more compute and Colossus 2's million H100-equivalents, anticipating a major leap in capability. Composer 2.5 is priced at $0.50/M input and $2.50/M output tokens, with a faster variant at $3.00/M input and $15.00/M output tokens.
Key takeaway
For AI/ML engineering leaders evaluating advanced code assistants, Composer 2.5 offers substantial improvements in task reliability and collaborative behavior, driven by sophisticated training techniques. You should consider its new pricing tiers and the double usage offer for the first week to assess its fit for your development workflows, especially for long-running or complex coding projects. Its underlying training innovations suggest a robust foundation for future model capabilities.
Key insights
Composer 2.5 improves AI assistant performance through targeted RL, synthetic data, and advanced distributed training.
Principles
- Localized feedback improves RL credit assignment.
- Synthetic tasks scale intelligence and expose reward hacking.
- Orthogonalization optimizes distributed MoE training.
Method
Composer 2.5 uses targeted textual feedback for localized RL, dynamically generated synthetic tasks for intelligence scaling, and sharded Muon with dual mesh HSDP for efficient distributed training of MoE models.
In practice
- Apply textual feedback for specific behavior correction.
- Generate synthetic tasks to expand training data.
- Utilize HSDP with separate layouts for MoE models.
Topics
- Composer 2.5
- Reinforcement Learning
- Textual Feedback
- Synthetic Data Generation
- Distributed Training
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Cursor Blog.