EvoForest: A Novel Machine-Learning Paradigm via Open-Ended Evolution of Computational Graphs
Summary
EvoForest is a novel hybrid neuro-symbolic system designed for open-ended evolution of computational graphs in machine learning, particularly for structured prediction problems where discovering appropriate computations is critical. Unlike traditional methods focused on parameter optimization, EvoForest jointly evolves reusable computational structures, callable function families, and trainable low-dimensional continuous components within a shared directed acyclic graph (DAG). It directly evaluates candidate graph configurations against non-differentiable cross-validation targets, using a lightweight Ridge-based readout. The system incorporates a diagnostic feedback loop, converting evaluator internals into structured feedback that guides future LLM-driven mutation proposals. In the 2025 ADIA Lab Structural Break Challenge, EvoForest achieved a 94.13% ROC-AUC after 600 evolution steps, surpassing the publicly reported winning score of 90.14% under the same protocol. This framework exemplifies a "search-first" paradigm, emphasizing explicit discovery, reuse, refinement, and selection of useful computations.
Key takeaway
For AI Engineers and Research Scientists tackling complex structured prediction problems with non-differentiable objectives, EvoForest offers a compelling alternative to traditional parameter-centric ML. Its "search-first" approach, which explicitly discovers and refines computational structures, can yield superior performance and even transferable algorithmic knowledge, as demonstrated by the Multi-Space Random Features (MSRF) discovery. You should explore adopting evolutionary computation frameworks that integrate diagnostic feedback and LLM-guided search to uncover novel, high-performing computational motifs.
Key insights
EvoForest evolves computational graphs and functions, guided by diagnostics, to discover optimal computations for structured prediction.
Principles
- Learning as computation search, not just parameter fitting.
- Diagnostic feedback drives LLM-guided evolutionary mutations.
- Multi-alternative DAGs enable combinatorial search and reuse.
Method
EvoForest uses an asynchronous island model with LLM-guided "scientist" and "engineer" stages to propose and synthesize graph mutations, refining continuous parameters via gradient descent and scoring configurations with a Ridge model.
In practice
- Apply EvoForest for non-differentiable, cross-validation-based objectives.
- Consider multi-space feature allocation for time-series problems.
- Use diagnostic feedback to guide feature engineering.
Topics
- EvoForest
- Computational Graph Evolution
- Neuro-Symbolic AI
- Search-First Machine Learning
- ADIA Lab Structural Break Challenge
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.NE updates on arXiv.org.