M2.7 just BROKE the Entire Industry...

· Source: Wes Roth · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Software Development & Engineering · Depth: Advanced, extended

Summary

Miniax, a Chinese company founded in 2022 with investments from Alibaba and Tencent, has released M2.7, an AI model demonstrating "self-evolution." This model autonomously improves its own "harness"—the data pipelines, training environments, and tools it uses—through a three-step process. Initially, an early M2.7 checkpoint built an internal research agent harness, handling 30-50% of the reinforcement learning team's workflow. Subsequently, M2.7 recursively improved this harness by tracking its performance, collecting feedback, and iterating on its architecture and skills. The final step involved autonomous scaffold optimization, where M2.7 ran over 100 rounds of hypothesis generation, experiment design, code modification, and benchmark testing with zero human input. This process led to a 30% improvement on internal benchmarks and a score of 66.6 on OpenAI's MLE Bench, tying with Gemini 3.1 and nearing the performance of top-tier models like Opus 4.6 and GPT 5.4, all while running on a single A30 GPU. Miniax is also using M2.7 to restructure its operations into an "AI native organization" and has launched Open Room, an open-source AI agent with a human-like personality.

Key takeaway

For CTOs and entrepreneurs evaluating AI integration, Miniax's M2.7 demonstrates that advanced self-evolving AI, capable of autonomous research and operational optimization, is achievable on accessible hardware. You should consider how such AI-native approaches could fundamentally restructure your organization's workflows and accelerate product development, potentially leading to "zero-person" company models or highly augmented teams. Explore open-source initiatives like Open Room to understand practical implementations of AI agents with integrated personalities.

Key insights

Miniax's M2.7 model demonstrates autonomous self-improvement, achieving significant performance gains and nearing top-tier AI models on consumer-grade hardware.

Principles

Method

M2.7 builds its own operational harness, recursively improves it by tracking performance and iterating on architecture, then autonomously optimizes its scaffold through iterative hypothesis testing and experimentation without human intervention.

In practice

Topics

Best for: Investor, Entrepreneur, CTO, AI Engineer, Machine Learning Engineer, AI Researcher

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Wes Roth.