optimize_anything: A Universal API for Optimizing any Text Parameter

· Source: cs.NE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

optimize_anything is a new LLM-based optimization system that unifies diverse problem-solving tasks under a single declarative API. It can optimize text artifacts across six fundamentally different domains, including code, prompts, agent architectures, numerical configurations, and images. The system achieves state-of-the-art results, such as nearly tripling Gemini Flash's ARC-AGI accuracy from 32.5% to 89.5%, cutting cloud costs by 40% with discovered scheduling algorithms, and generating CUDA kernels where 87% match or beat PyTorch baselines. It supports single-task, multi-task, and generalization modes, with multi-task search demonstrating cross-problem transfer benefits. A key feature is the use of actionable "Side Information" (SI) from evaluators, which provides diagnostic feedback to the LLM proposer, leading to 4-6x faster convergence and higher final performance compared to score-only feedback. The system is open-sourced as part of the GEPA project.

Key takeaway

For research scientists developing optimization solutions, optimize_anything demonstrates that a single LLM-based framework can outperform specialized tools across diverse domains. You should consider formulating your optimization problems as text artifact improvements, leveraging rich diagnostic feedback (Side Information) to guide LLM proposers, and exploring multi-task search for related problems to achieve faster convergence and superior results compared to single-task approaches.

Key insights

A unified LLM-based system optimizes diverse text artifacts across domains using diagnostic feedback and multi-task search.

Principles

Method

The system takes a seed artifact and an evaluator returning a score and optional diagnostic feedback. An LLM proposer iteratively refines the artifact based on this feedback, using Pareto-based search across three optimization modes.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.NE updates on arXiv.org.