SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

SkillsVote is a lifecycle-governance framework designed for managing Agent Skills, which are defined as an experience schema combining executable scripts with non-executable procedural guidance. The framework addresses challenges in open skill ecosystems, such as redundancy, uneven quality, and environment sensitivity, which can lead to context pollution. SkillsVote profiles a million-scale open-source corpus to assess environment requirements, quality, and verifiability, then synthesizes tasks for verifiable skills. It performs agentic library searches to provide instructional skill context before execution. After execution, SkillsVote decomposes trajectories into skill-linked subtasks, attributes outcomes, and only admits successful, reusable discoveries for evidence-gated updates. Evaluations show offline evolution improves GPT-5.2 on Terminal-Bench 2.0 by up to 7.9 percentage points, and online evolution improves SWE-Bench Pro by up to 2.6 percentage points.

Key takeaway

For research scientists developing long-horizon LLM agents, implementing a governance framework like SkillsVote can significantly enhance agent performance and reusability. You should focus on structuring skill libraries, verifying skill quality, and establishing clear mechanisms for attributing outcomes and updating skills based on successful execution to prevent context pollution and improve agent capabilities without needing model updates.

Key insights

SkillsVote governs agent skill lifecycles, improving LLM agent performance through structured collection, recommendation, and evolution.

Principles

Method

SkillsVote profiles skill corpora, synthesizes verifiable tasks, performs agentic library search, decomposes trajectories, attributes outcomes, and gates updates based on successful, reusable discoveries.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.