CLLGMar 11, 2025

Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation

arXiv:2503.08963v213 citationsh-index: 6Has CodeNAACL
Originality Incremental advance
AI Analysis

This addresses the problem of unreliable outputs in LLMs for users of summarization and QA systems, though it appears incremental as it builds on existing attention mechanisms.

The paper tackles contextual hallucination in LLMs during summarization and QA tasks, where models generate irrelevant responses despite having correct source information, by introducing Gradient-guided Attention Map Editing (GAME) to dynamically adjust attention maps. The method reduces hallucinations by 10% on XSum summarization while achieving a 7x speed-up in computational efficiency compared to baselines.

In tasks like summarization and open-book question answering (QA), Large Language Models (LLMs) often encounter "contextual hallucination", where they produce irrelevant or incorrect responses despite having access to accurate source information. This typically occurs because these models tend to prioritize self-generated content over the input context, causing them to disregard pertinent details. To address this challenge, we introduce a novel method called "Guided Attention Map Editing" (GAME), which dynamically adjusts attention maps to improve contextual relevance. During inference, GAME employs a trained classifier to identify attention maps prone to inducing hallucinations and executes targeted interventions. These interventions, guided by gradient-informed "edit directions'', strategically redistribute attention weights across various heads to effectively reduce hallucination. Comprehensive evaluations on challenging summarization and open-book QA tasks show that GAME consistently reduces hallucinations across a variety of open-source models. Specifically, GAME reduces hallucinations by 10% in the XSum summarization task while achieving a 7X speed-up in computational efficiency compared to the state-of-the-art baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes