CLAIDec 19, 2024

Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation

arXiv:2412.14905v1h-index: 16
Originality Incremental advance
AI Analysis

This addresses hallucination issues in RAG for users of large language models, but it is incremental as it builds on existing parallel context extension methods.

The paper tackles the problem of hallucinations in retrieval-augmented generation when using parallel context extension, proposing DePaC to reduce fact fabrication and omission, with experimental results showing significant alleviation of hallucinations and better performance on nine RAG tasks.

Large language models (LLMs) are susceptible to generating hallucinated information, despite the integration of retrieval-augmented generation (RAG). Parallel context extension (PCE) is a line of research attempting to effectively integrating parallel (unordered) contexts, while it still suffers from hallucinations when adapted to RAG scenarios. In this paper, we propose DePaC (Dehallucinating Parallel Context Extension), which alleviates the hallucination problem with context-aware negative training and information-calibrated aggregation. DePaC is designed to alleviate two types of in-context hallucination: fact fabrication (i.e., LLMs present claims that are not supported by the contexts) and fact omission (i.e., LLMs fail to present claims that can be supported by the contexts). Specifically, (1) for fact fabrication, we apply the context-aware negative training that fine-tunes the LLMs with negative supervisions, thus explicitly guiding the LLMs to refuse to answer when contexts are not related to questions; (2) for fact omission, we propose the information-calibrated aggregation which prioritizes context windows with higher information increment from their contexts. The experimental results on nine RAG tasks demonstrate that DePaC significantly alleviates the two types of hallucination and consistently achieves better performances on these tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes