AIApr 15, 2025

NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes

arXiv:2504.11544v117 citationsh-index: 10Has Code
Originality Incremental advance
AI Analysis

This addresses performance bottlenecks in graph-based RAG for domain-specific applications like question-answering, though it appears incremental relative to existing graph-RAG approaches.

The paper tackles the problem of suboptimal graph structure design in graph-based retrieval-augmented generation (RAG) systems, proposing NodeRAG, a framework with heterogeneous graph structures that improves indexing time by 30%, query time by 25%, and achieves 5-8% higher accuracy on multi-hop benchmarks compared to prior methods.

Retrieval-augmented generation (RAG) empowers large language models to access external and private corpus, enabling factually consistent responses in specific domains. By exploiting the inherent structure of the corpus, graph-based RAG methods further enrich this process by building a knowledge graph index and leveraging the structural nature of graphs. However, current graph-based RAG approaches seldom prioritize the design of graph structures. Inadequately designed graph not only impede the seamless integration of diverse graph algorithms but also result in workflow inconsistencies and degraded performance. To further unleash the potential of graph for RAG, we propose NodeRAG, a graph-centric framework introducing heterogeneous graph structures that enable the seamless and holistic integration of graph-based methodologies into the RAG workflow. By aligning closely with the capabilities of LLMs, this framework ensures a fully cohesive and efficient end-to-end process. Through extensive experiments, we demonstrate that NodeRAG exhibits performance advantages over previous methods, including GraphRAG and LightRAG, not only in indexing time, query time, and storage efficiency but also in delivering superior question-answering performance on multi-hop benchmarks and open-ended head-to-head evaluations with minimal retrieval tokens. Our GitHub repository could be seen at https://github.com/Terry-Xu-666/NodeRAG.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes