AIApr 7

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

arXiv:2604.0533369.713 citations
AI Analysis

This addresses efficiency and performance issues for developers and users of large-scale agent systems, though it is incremental as it builds on existing retrieval and skill-based methods.

The paper tackles the problem of scaling agent skill libraries to thousands of skills, which saturates context windows and increases costs, by introducing Graph of Skills (GoS), a structural retrieval layer that improves average reward by 43.6% and reduces input tokens by 37.8% compared to full skill-loading baselines.

Skill usage has become a core component of modern agent systems and can substantially improve agents' ability to complete complex tasks. In real-world settings, where agents must monitor and interact with numerous personal applications, web browsers, and other environment interfaces, skill libraries can scale to thousands of reusable skills. Scaling to larger skill sets introduces two key challenges. First, loading the full skill set saturates the context window, driving up token costs, hallucination, and latency. In this paper, we present Graph of Skills (GoS), an inference-time structural retrieval layer for large skill libraries. GoS constructs an executable skill graph offline from skill packages, then at inference time retrieves a bounded, dependency-aware skill bundle through hybrid semantic-lexical seeding, reverse-weighted Personalized PageRank, and context-budgeted hydration. On SkillsBench and ALFWorld, GoS improves average reward by 43.6% over the vanilla full skill-loading baseline while reducing input tokens by 37.8%, and generalizes across three model families: Claude Sonnet, GPT-5.2 Codex, and MiniMax. Additional ablation studies across skill libraries ranging from 200 to 2,000 skills further demonstrate that GoS consistently outperforms both vanilla skills loading and simple vector retrieval in balancing reward, token efficiency, and runtime.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes