Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, extended

Summary

Trace2Skill is a novel framework designed to automatically create and adapt domain-specific skills for Large Language Model (LLM) agents, addressing the scalability issues of manual authoring and limitations of sequential skill generation. It operates by deploying a parallel fleet of sub-agents to analyze diverse execution trajectories, extracting lessons, and then hierarchically consolidating them into a unified, conflict-free skill directory through inductive reasoning. This approach supports both refining existing human-written skills and generating new ones from scratch. Experiments across challenging domains like spreadsheet, VisionQA, and math reasoning demonstrate that Trace2Skill significantly improves performance over strong baselines, with skills evolved by Qwen3.5-35B enhancing a Qwen3.5-122B agent by up to 57.65 absolute percentage points on WikiTableQuestions. The framework ensures skill transferability across LLM scales and generalization to out-of-distribution tasks, requiring no parameter updates or external retrieval modules, and utilizing open-source models as small as 35B parameters.

Key takeaway

For MLOps Engineers and AI Scientists deploying LLM agents in complex, domain-specific environments, you should consider Trace2Skill's approach to skill evolution. Its parallel analysis and inductive consolidation of agent trajectories offer a robust method to create or deepen skills, outperforming sequential updates and retrieval-based systems. This framework enables highly transferable, declarative skills across LLM scales and out-of-distribution tasks, reducing dependency on manual authoring or proprietary models, and improving agent performance without parameter updates.

Key insights

Trace2Skill distills broad agent experience into transferable, declarative skills via parallel analysis and inductive consolidation.

Principles

Method

Trace2Skill generates trajectories, dispatches parallel sub-agents for patch proposals from successes/failures, then hierarchically consolidates these patches via inductive reasoning and programmatic conflict prevention into a unified skill.

In practice

Topics

Best for: Research Scientist, AI Architect, NLP Engineer, AI Scientist, AI Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.