MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

2026-06-06 · Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, long

Summary

MLEvolve is an LLM-based, self-evolving multi-agent framework designed for automated machine learning algorithm discovery. It addresses limitations in existing MLE agents, such as inter-branch information isolation, memoryless search, and lack of hierarchical control. MLEvolve integrates Progressive Monte Carlo Graph Search (MCGS) for cross-branch information flow and adaptive exploration, Retrospective Memory for accumulating and reusing task-specific experience, and Hierarchical Planning with Adaptive Code Generation to decouple strategy from implementation. Evaluated on MLE-Bench, MLEvolve achieved a 65.3% average medal rate under a 12-hour budget, outperforming proprietary and open-source baselines, including AlphaEvolve, on 75 end-to-end machine learning tasks and 15 mathematical optimization tasks.

Key takeaway

For machine learning engineers developing autonomous agent systems, MLEvolve demonstrates a robust approach to overcoming common limitations in long-horizon algorithm discovery. You should consider integrating graph-based search, dynamic memory mechanisms, and hierarchical planning with adaptive code generation to enhance agent self-evolution, improve solution quality, and manage complex, iterative ML pipeline optimization tasks more effectively.

Key insights

MLEvolve enables LLM agents to self-evolve for long-horizon ML algorithm discovery via graph search, memory, and adaptive code generation.

Principles

Cross-branch information flow improves search efficiency.
Accumulated experience guides future planning.
Decoupling planning from coding enhances control.

Method

MLEvolve uses Progressive MCGS with graph-based expansion and an entropy-inspired schedule, Retrospective Memory for experience, and Hierarchical Planning with adaptive code generation modes (Base, Stepwise, Diff).

In practice

Implement graph-based search for complex problem spaces.
Integrate dynamic memory for iterative task learning.
Use adaptive code generation for controlled modifications.

Topics

LLM Agents
Automated Machine Learning
Monte Carlo Tree Search
Graph-based Search
Retrospective Memory
Code Generation

Code references

InternScience/MLEvolve

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.