CL AIDec 9, 2025

Detecting Hallucinations in Graph Retrieval-Augmented Generation via Attention Patterns and Semantic Alignment

Shanghao Li, Jinda Han, Yibo Wang, Yuanjie Zhu, Zihe Song, Langzhou He, Kenan Kamel A Alghythee, Philip S. Yu

arXiv:2512.09148v12.71 citationsh-index: 6

Originality Incremental advance

AI Analysis

This work addresses hallucinations in GraphRAG systems for knowledge-based QA, offering incremental improvements through interpretability metrics and a detector to enhance reliability.

The paper tackled the problem of hallucinations in Graph Retrieval-Augmented Generation (GraphRAG) by analyzing how LLMs attend to structured knowledge, identifying failure patterns like over-reliance on shortest paths and weak semantic grounding, and developing a lightweight detector (GGA) that outperformed baselines in AUC and F1 scores.

Graph-based Retrieval-Augmented Generation (GraphRAG) enhances Large Language Models (LLMs) by incorporating external knowledge from linearized subgraphs retrieved from knowledge graphs. However, LLMs struggle to interpret the relational and topological information in these inputs, resulting in hallucinations that are inconsistent with the retrieved knowledge. To analyze how LLMs attend to and retain structured knowledge during generation, we propose two lightweight interpretability metrics: Path Reliance Degree (PRD), which measures over-reliance on shortest-path triples, and Semantic Alignment Score (SAS), which assesses how well the model's internal representations align with the retrieved knowledge. Through empirical analysis on a knowledge-based QA task, we identify failure patterns associated with over-reliance on salient paths and weak semantic grounding, as indicated by high PRD and low SAS scores. We further develop a lightweight post-hoc hallucination detector, Graph Grounding and Alignment (GGA), which outperforms strong semantic and confidence-based baselines across AUC and F1. By grounding hallucination analysis in mechanistic interpretability, our work offers insights into how structural limitations in LLMs contribute to hallucinations, informing the design of more reliable GraphRAG systems in the future.

View on arXiv PDF

Similar