CL AIAug 28, 2024

LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance Propagation

Haichuan Hu, Congqing He, Xiaochen Xie, Quanjun Zhang

arXiv:2408.15533v39.119 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This addresses hallucinations in RAG for improving reliability in applications like question-answering, but it is incremental as it adapts an existing technique (LRP) to a new task.

The paper tackles the problem of persistent hallucinations in Retrieval-Augmented Generation (RAG) systems by proposing LRP4RAG, a method that uses Layer-wise Relevance Propagation to detect hallucinations, and it outperforms existing baselines in experiments.

Retrieval-Augmented Generation (RAG) has become a primary technique for mitigating hallucinations in large language models (LLMs). However, incomplete knowledge extraction and insufficient understanding can still mislead LLMs to produce irrelevant or even contradictory responses, which means hallucinations persist in RAG. In this paper, we propose LRP4RAG, a method based on the Layer-wise Relevance Propagation (LRP) algorithm for detecting hallucinations in RAG. Specifically, we first utilize LRP to compute the relevance between the input and output of the RAG generator. We then apply further extraction and resampling to the relevance matrix. The processed relevance data are input into multiple classifiers to determine whether the output contains hallucinations. To the best of our knowledge, this is the first time that LRP has been used for detecting RAG hallucinations, and extensive experiments demonstrate that LRP4RAG outperforms existing baselines.

View on arXiv PDF Code

Similar