M2.7 just BROKE the Entire Industry...
Summary
Miniax, a Chinese company founded in 2022 with investments from Alibaba and Tencent, has released M2.7, an AI model demonstrating "self-evolution." This model autonomously improves its own "harness"—the data pipelines, training environments, and tools it uses—through a three-step process. Initially, an early M2.7 checkpoint built an internal research agent harness, handling 30-50% of the reinforcement learning team's workflow. Subsequently, M2.7 recursively improved this harness by tracking its performance, collecting feedback, and iterating on its architecture and skills. The final step involved autonomous scaffold optimization, where M2.7 ran over 100 rounds of hypothesis generation, experiment design, code modification, and benchmark testing with zero human input. This process led to a 30% improvement on internal benchmarks and a score of 66.6 on OpenAI's MLE Bench, tying with Gemini 3.1 and nearing the performance of top-tier models like Opus 4.6 and GPT 5.4, all while running on a single A30 GPU. Miniax is also using M2.7 to restructure its operations into an "AI native organization" and has launched Open Room, an open-source AI agent with a human-like personality.
Key takeaway
For CTOs and entrepreneurs evaluating AI integration, Miniax's M2.7 demonstrates that advanced self-evolving AI, capable of autonomous research and operational optimization, is achievable on accessible hardware. You should consider how such AI-native approaches could fundamentally restructure your organization's workflows and accelerate product development, potentially leading to "zero-person" company models or highly augmented teams. Explore open-source initiatives like Open Room to understand practical implementations of AI agents with integrated personalities.
Key insights
Miniax's M2.7 model demonstrates autonomous self-improvement, achieving significant performance gains and nearing top-tier AI models on consumer-grade hardware.
Principles
- AI models can autonomously improve their operational frameworks.
- Recursive self-improvement can significantly boost model performance.
- Personality enhances user interaction with AI agents.
Method
M2.7 builds its own operational harness, recursively improves it by tracking performance and iterating on architecture, then autonomously optimizes its scaffold through iterative hypothesis testing and experimentation without human intervention.
In practice
- Utilize AI agents for end-to-end ML workflow automation.
- Explore self-improving AI for business process optimization.
- Consider AI agent personality for enhanced user engagement.
Topics
- Self-Evolution
- AI Agents
- MLE Benchmarks
- Model Optimization
- AI Native Organizations
Best for: Investor, Entrepreneur, CTO, AI Engineer, Machine Learning Engineer, AI Researcher
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Wes Roth.