SkillGraph: Graph Foundation Priors for LLM Agent Tool Sequence Recommendation

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems · Depth: Expert, quick

Summary

SkillGraph is a novel framework designed to improve tool selection and ordering for LLM agents interacting with large API libraries. It addresses the limitations of semantic similarity methods, which often fail to account for inter-tool data dependencies, leading to negative Kendall-τ scores in structured workflows. SkillGraph introduces a directed weighted execution-transition graph, a "graph foundation prior," derived from 49,831 successful LLM agent trajectories. This prior encodes workflow-precedence regularities. The framework employs a two-stage decoupled approach: GS-Hybrid retrieval for candidate tool selection and a learned pairwise reranker for optimal ordering. On the ToolBench dataset (9,965 test instances, ~16,000 tools), SkillGraph achieved a Set-F1 of 0.271 and a Kendall-τ of 0.096. On API-Bank, it significantly improved Kendall-τ from -0.433 to +0.613, outperforming LLaMA-3.1-8B Stage-2 rerankers under identical Stage-1 inputs.

Key takeaway

For NLP engineers developing LLM agents that interact with extensive API libraries, SkillGraph offers a robust method to enhance tool selection and sequencing accuracy. Your current semantic-only approaches may be insufficient for complex workflows, leading to suboptimal agent performance. Consider integrating graph-based priors derived from successful agent trajectories to capture critical inter-tool dependencies, significantly improving tool ordering and overall agent efficacy.

Key insights

SkillGraph uses a graph-based prior from agent trajectories to improve LLM tool sequence recommendation beyond semantic similarity.

Principles

Method

SkillGraph uses a two-stage decoupled framework: GS-Hybrid retrieval for candidate selection and a learned pairwise reranker for ordering, built on a graph foundation prior.

In practice

Topics

Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.