CL AI IRDec 17, 2024

SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation

Yuzheng Cai, Zhenyue Guo, Yiwen Pei, Wanrui Bian, Weiguo Zheng

arXiv:2412.15272v26.111 citationsh-index: 5Has CodeACL

Originality Incremental advance

AI Analysis

This addresses the challenge of reducing hallucinations in large language models for applications like question answering and fact verification, representing an incremental improvement over existing knowledge graph-driven retrieval-augmented generation methods.

The paper tackles the problem of aligning query texts with knowledge graph structures in retrieval-augmented generation by proposing SimGRAG, a two-stage method that transforms queries into graph patterns and retrieves similar subgraphs using a graph semantic distance metric, achieving state-of-the-art performance in question answering and fact verification with retrieval within 1 second on a 10-million-scale knowledge graph.

Recent advancements in large language models (LLMs) have shown impressive versatility across various tasks. To eliminate their hallucinations, retrieval-augmented generation (RAG) has emerged as a powerful approach, leveraging external knowledge sources like knowledge graphs (KGs). In this paper, we study the task of KG-driven RAG and propose a novel Similar Graph Enhanced Retrieval-Augmented Generation (SimGRAG) method. It effectively addresses the challenge of aligning query texts and KG structures through a two-stage process: (1) query-to-pattern, which uses an LLM to transform queries into a desired graph pattern, and (2) pattern-to-subgraph, which quantifies the alignment between the pattern and candidate subgraphs using a graph semantic distance (GSD) metric. We also develop an optimized retrieval algorithm that efficiently identifies the top-k subgraphs within 1-second on a 10-million-scale KG. Extensive experiments show that SimGRAG outperforms state-of-the-art KG-driven RAG methods in both question answering and fact verification. Our code is available at https://github.com/YZ-Cai/SimGRAG.

View on arXiv PDF Code

Similar