CL AIMay 21, 2025

RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection

Yiming Huang, Junyan Zhang, Zihao Wang, Biquan Bie, Yunzhong Qiu, Yi R. Fung, Xinlei He

arXiv:2505.15386v210.93 citationsh-index: 5SIGGRAPH

Originality Incremental advance

AI Analysis

This addresses the issue of untrustworthy LLM outputs for users needing explainable AI, though it is incremental as it builds on existing uncertainty-based detection methods.

The paper tackles the problem of hallucination detection in large language models by proposing RePPL, a method that recalibrates uncertainty measurement based on semantic propagation and language generation, achieving an average AUC of 0.833 across various QA datasets and providing token-level explanations for hallucinations.

Large Language Models (LLMs) have become powerful, but hallucinations remain a vital obstacle to their trustworthy use. While previous works improved the capability of hallucination detection by measuring uncertainty, they all lack the ability to explain the provenance behind why hallucinations occur, i.e., which part of the inputs tends to trigger hallucinations. Recent works on the prompt attack indicate that uncertainty exists in semantic propagation, where attention mechanisms gradually fuse local token information into high-level semantics across layers. Meanwhile, uncertainty also emerges in language generation, due to its probability-based selection of high-level semantics for sampled generations. Based on that, we propose RePPL to recalibrate uncertainty measurement by these two aspects, which dispatches explainable uncertainty scores to each token and aggregates in Perplexity-style Log-Average form as total score. Experiments show that our method achieves the best comprehensive detection performance across various QA datasets on advanced models (average AUC of 0.833), and our method is capable of producing token-level uncertainty scores as explanations for the hallucination. Leveraging these scores, we preliminarily find the chaotic pattern of hallucination and showcase its promising usage.

View on arXiv PDF

Similar