OptSkills: Learning Generalizable Optimization Skills from Problem Archetypes via Cluster-Based Distillation

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

OptSkills is an archetype-centric skill learning and reasoning agent system designed to enhance the generalization capabilities of Large Language Models (LLMs) in automated optimization. Addressing limitations of current methods, which struggle with narrative variations and adapting to new problem types, OptSkills employs a novel approach. It clusters optimization problems by their underlying archetypes rather than surface narratives to improve robust generalization. For in-distribution generalization, the system explores diverse modeling paradigms and solver configurations within each cluster, distilling successful trajectories into reusable workflow-level skills. Out-of-distribution generalization is achieved by refining existing skills or expanding the skill library with newly obtained trajectories. OptSkills achieves a state-of-the-art micro-averaged accuracy of 68.27% on diverse datasets. Furthermore, it reaches 26.91% accuracy on the challenging MIPLIB-NL benchmark, surpassing DeepSeek-V3.2-Thinking by 4.53%, and 72.79% on the OOD NLCO benchmark after learning on Nano-CO.

Key takeaway

For Machine Learning Engineers developing automated optimization systems with Large Language Models, OptSkills presents a critical advancement. Its archetype-centric skill learning significantly improves generalization, moving beyond superficial narrative variations. You should explore integrating similar cluster-based distillation and skill refinement techniques to enhance your system's robustness and adaptability to new problem types. This approach can lead to higher accuracy and more reliable performance across diverse optimization challenges.

Key insights

OptSkills learns generalizable optimization skills by clustering problem archetypes and distilling successful workflow trajectories.

Principles

Method

OptSkills clusters problems by archetypes, explores diverse modeling and solver configurations, then distills successful trajectories into reusable workflow-level skills, refining them with new data.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.