IRCLApr 8, 2025

Graph-based Approaches and Functionalities in Retrieval-Augmented Generation: A Comprehensive Survey

arXiv:2504.10499v121 citationsh-index: 9ACM Computing Surveys
Originality Synthesis-oriented
AI Analysis

It offers a novel perspective for researchers in graph learning, database systems, and NLP by organizing existing methods and identifying challenges, though it is incremental as a survey.

This survey tackles the lack of a unified review on the roles of graphs in Retrieval-Augmented Generation (RAG) to address LLM hallucination issues, providing a comprehensive analysis of graph functionalities across database construction, algorithms, pipelines, and tasks.

Large language models (LLMs) struggle with the factual error during inference due to the lack of sufficient training data and the most updated knowledge, leading to the hallucination problem. Retrieval-Augmented Generation (RAG) has gained attention as a promising solution to address the limitation of LLMs, by retrieving relevant information from external source to generate more accurate answers to the questions. Given the pervasive presence of structured knowledge in the external source, considerable strides in RAG have been made to employ the techniques related to graphs and achieve more complex reasoning based on the topological information between knowledge entities. However, there is currently neither unified review examining the diverse roles of graphs in RAG, nor a comprehensive resource to help researchers navigate and contribute to this evolving field. This survey offers a novel perspective on the functionality of graphs within RAG and their impact on enhancing performance across a wide range of graph-structured data. It provides a detailed breakdown of the roles that graphs play in RAG, covering database construction, algorithms, pipelines, and tasks. Finally, it identifies current challenges and outline future research directions, aiming to inspire further developments in this field. Our graph-centered analysis highlights the commonalities and differences in existing methods, setting the stage for future researchers in areas such as graph learning, database systems, and natural language processing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes