From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?
Summary
Researchers from The Chinese University of Hong Kong and LIGHTSPEED introduce "Metric Freedom" ($F$), an a priori predictor for determining when distilling a Multi-Agent System (MAS) into a single-agent skill is beneficial. The study reveals that skill utility is governed by the evaluation metric's topological rigidity, not the task itself, with empirical outcomes showing skill lift ranging from a 28% improvement to a 2% degradation for the same task. They propose a two-stage adaptive distillation framework: Stage 1 selectively extracts tools and knowledge, discarding restrictive structures for "free" metrics ($F\approx 1$) while preserving exploration; Stage 2 targets computationally intensive iterative refinement exclusively toward "rigid" metrics ($F\lesssim 0.6$) to prevent trajectory-local overfitting. Evaluating across 4 tasks, 11 datasets, and 6 metrics, $F$ strongly predicts skill utility ($\rho{=}{-}0.62$, $p{<}0.05$). This adaptive agent matches or exceeds original MAS performance while reducing inference cost by 1.4-8\times and latency by up to 15\times.
Key takeaway
For research scientists evaluating or implementing multi-agent system (MAS) to single-agent skill distillation, you should first calculate the Metric Freedom ($F$) of your evaluation metric. This will predict whether skill augmentation will be beneficial or detrimental, guiding your distillation strategy to either preserve exploration for "free" metrics ($F\approx 1$) or apply structured guidance and iterative refinement for "rigid" metrics ($F\lesssim 0.6$), thereby optimizing performance and significantly reducing cost and latency.
Key insights
Skill distillation utility is determined by evaluation metric rigidity, not task complexity, quantified by Metric Freedom ($F$).
Principles
- Skill utility is a metric-level property.
- Structure helps rigid metrics, hurts free metrics.
- Tools extend capability without constraining reasoning.
Method
A two-stage adaptive distillation framework uses Metric Freedom ($F$) to selectively extract MAS components and guide iterative refinement, targeting rigid metrics ($F\lesssim 0.6$) for optimization.
In practice
- Compute Metric Freedom ($F$) from baseline runs.
- Discard MAS coordination structures for high-$F$ metrics.
- Refine skills iteratively only for low-$F$ metrics.
Topics
- Metric Freedom
- Skill Distillation
- Multi-Agent Systems
- Single-Agent LLMs
- Adaptive Distillation Framework
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.