CLAILGJun 20, 2024

Large Language Models are Skeptics: False Negative Problem of Input-conflicting Hallucination

arXiv:2406.13929v17 citations
Originality Incremental advance
AI Analysis

This addresses a specific hallucination issue in LLMs that affects reliability in applications like fact-checking, though it is incremental as it focuses on one type of bias.

The paper identifies a new bias called the false negative problem in large language models (LLMs), where they tend to generate input-conflicting hallucinations by returning negative judgments when assessing statement correctness, with experiments showing greater overconfidence in false responses. It finds that context and query rewriting effectively reduce this bias.

In this paper, we identify a new category of bias that induces input-conflicting hallucinations, where large language models (LLMs) generate responses inconsistent with the content of the input context. This issue we have termed the false negative problem refers to the phenomenon where LLMs are predisposed to return negative judgments when assessing the correctness of a statement given the context. In experiments involving pairs of statements that contain the same information but have contradictory factual directions, we observe that LLMs exhibit a bias toward false negatives. Specifically, the model presents greater overconfidence when responding with False. Furthermore, we analyze the relationship between the false negative problem and context and query rewriting and observe that both effectively tackle false negatives in LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes