optimize_anything: A Universal API for Optimizing any Text Parameter
Summary
optimize_anything is a new LLM-based optimization system that unifies diverse problem-solving tasks under a single declarative API. It can optimize text artifacts across six fundamentally different domains, including code, prompts, agent architectures, numerical configurations, and images. The system achieves state-of-the-art results, such as nearly tripling Gemini Flash's ARC-AGI accuracy from 32.5% to 89.5%, cutting cloud costs by 40% with discovered scheduling algorithms, and generating CUDA kernels where 87% match or beat PyTorch baselines. It supports single-task, multi-task, and generalization modes, with multi-task search demonstrating cross-problem transfer benefits. A key feature is the use of actionable "Side Information" (SI) from evaluators, which provides diagnostic feedback to the LLM proposer, leading to 4-6x faster convergence and higher final performance compared to score-only feedback. The system is open-sourced as part of the GEPA project.
Key takeaway
For research scientists developing optimization solutions, optimize_anything demonstrates that a single LLM-based framework can outperform specialized tools across diverse domains. You should consider formulating your optimization problems as text artifact improvements, leveraging rich diagnostic feedback (Side Information) to guide LLM proposers, and exploring multi-task search for related problems to achieve faster convergence and superior results compared to single-task approaches.
Key insights
A unified LLM-based system optimizes diverse text artifacts across domains using diagnostic feedback and multi-task search.
Principles
- Text optimization is a general-purpose problem-solving paradigm.
- Actionable side information accelerates convergence and improves performance.
- Multi-task search enables cross-problem transfer of optimization patterns.
Method
The system takes a seed artifact and an evaluator returning a score and optional diagnostic feedback. An LLM proposer iteratively refines the artifact based on this feedback, using Pareto-based search across three optimization modes.
In practice
- Formulate optimization problems as improving text artifacts.
- Design evaluators to return rich diagnostic "Side Information" (SI).
- Utilize multi-task search for related problems to leverage cross-transfer.
Topics
- optimize_anything API
- LLM-based Text Optimization
- Side Information
- Multi-task Search
- Agent Architecture Search
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.NE updates on arXiv.org.