Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
Summary
Trace2Skill is a novel framework designed to automatically create and adapt domain-specific skills for Large Language Model (LLM) agents, addressing the scalability issues of manual authoring and limitations of sequential skill generation. It operates by deploying a parallel fleet of sub-agents to analyze diverse execution trajectories, extracting lessons, and then hierarchically consolidating them into a unified, conflict-free skill directory through inductive reasoning. This approach supports both refining existing human-written skills and generating new ones from scratch. Experiments across challenging domains like spreadsheet, VisionQA, and math reasoning demonstrate that Trace2Skill significantly improves performance over strong baselines, with skills evolved by Qwen3.5-35B enhancing a Qwen3.5-122B agent by up to 57.65 absolute percentage points on WikiTableQuestions. The framework ensures skill transferability across LLM scales and generalization to out-of-distribution tasks, requiring no parameter updates or external retrieval modules, and utilizing open-source models as small as 35B parameters.
Key takeaway
For MLOps Engineers and AI Scientists deploying LLM agents in complex, domain-specific environments, you should consider Trace2Skill's approach to skill evolution. Its parallel analysis and inductive consolidation of agent trajectories offer a robust method to create or deepen skills, outperforming sequential updates and retrieval-based systems. This framework enables highly transferable, declarative skills across LLM scales and out-of-distribution tasks, reducing dependency on manual authoring or proprietary models, and improving agent performance without parameter updates.
Key insights
Trace2Skill distills broad agent experience into transferable, declarative skills via parallel analysis and inductive consolidation.
Principles
- Holistic analysis of diverse trajectories yields generalizable skills.
- Parallel processing of experience prevents sequential drift and improves efficiency.
- Declarative skills offer better transferability than episodic retrieval.
Method
Trace2Skill generates trajectories, dispatches parallel sub-agents for patch proposals from successes/failures, then hierarchically consolidates these patches via inductive reasoning and programmatic conflict prevention into a unified skill.
In practice
- Deepen existing human-written skills for specific LLM agents.
- Create new, effective skills from scratch using parametric knowledge.
- Improve agent performance in spreadsheet, math, and VQA tasks.
Topics
- LLM Agents
- Skill Evolution
- Automated Skill Generation
- Trajectory Analysis
- Inductive Reasoning
- Qwen3.5
Best for: Research Scientist, AI Architect, NLP Engineer, AI Scientist, AI Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.