AIMar 2

GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Yuchen Ying, Weiqi Jiang, Tongya Zheng, Yu Wang, Shunyu Liu, Kaixuan Chen, Mingli Song

arXiv:2603.01410v12.41 citationsh-index: 17Has Code

Originality Highly original

AI Analysis

This addresses the problem of constrained graph exploration in agentic reasoning for applications using knowledge graphs and LLMs, representing a novel method rather than an incremental improvement.

The paper tackles the limitation of existing GraphRAG methods that rely on manual guidance and predefined tools for graph exploration by proposing GraphScout, a training-centric framework that enables LLMs to autonomously interact with knowledge graphs to synthesize training data, resulting in a small model (e.g., Qwen3-4B) outperforming baseline methods by an average of 16.7% across five domains with fewer inference tokens.

Knowledge graphs provide structured and reliable information for many real-world applications, motivating increasing interest in combining large language models (LLMs) with graph-based retrieval to improve factual grounding. Recent Graph-based Retrieval-Augmented Generation (GraphRAG) methods therefore introduce iterative interaction between LLMs and knowledge graphs to enhance reasoning capability. However, existing approaches typically depend on manually designed guidance and interact with knowledge graphs through a limited set of predefined tools, which substantially constrains graph exploration. To address these limitations, we propose GraphScout, a training-centric agentic graph reasoning framework equipped with more flexible graph exploration tools. GraphScout enables models to autonomously interact with knowledge graphs to synthesize structured training data which are then used to post-train LLMs, thereby internalizing agentic graph reasoning ability without laborious manual annotation or task curation. Extensive experiments across five knowledge-graph domains show that a small model (e.g., Qwen3-4B) augmented with GraphScout outperforms baseline methods built on leading LLMs (e.g., Qwen-Max) by an average of 16.7\% while requiring significantly fewer inference tokens. Moreover, GraphScout exhibits robust cross-domain transfer performance. Our code will be made publicly available~\footnote{https://github.com/Ying-Yuchen/_GraphScout_}.

View on arXiv PDF Code

Similar