CLFeb 20

Detecting Contextual Hallucinations in LLMs with Frequency-Aware Attention

Siya Qi, Yudong Chen, Runcong Zhao, Qinglin Zhu, Zhanghao Hu, Wei Liu, Yulan He, Zheng Yuan, Lin Gui

arXiv:2602.18145v11.11 citationsh-index: 4

Originality Incremental advance

AI Analysis

This addresses the reliability issue for users of LLMs in context-based generation, though it is incremental as it builds on prior attention-based methods.

The paper tackled the problem of detecting hallucinations in large language models by analyzing attention variations, finding that hallucinated tokens correlate with high-frequency attention energy, and developed a detector that outperformed existing methods on benchmarks like RAGTruth and HalluRAG.

Hallucination detection is critical for ensuring the reliability of large language models (LLMs) in context-based generation. Prior work has explored intrinsic signals available during generation, among which attention offers a direct view of grounding behavior. However, existing approaches typically rely on coarse summaries that fail to capture fine-grained instabilities in attention. Inspired by signal processing, we introduce a frequency-aware perspective on attention by analyzing its variation during generation. We model attention distributions as discrete signals and extract high-frequency components that reflect rapid local changes in attention. Our analysis reveals that hallucinated tokens are associated with high-frequency attention energy, reflecting fragmented and unstable grounding behavior. Based on this insight, we develop a lightweight hallucination detector using high-frequency attention features. Experiments on the RAGTruth and HalluRAG benchmarks show that our approach achieves performance gains over verification-based, internal-representation-based, and attention-based methods across models and tasks.

View on arXiv PDF

Similar