CLAISep 17, 2024

Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling

arXiv:2409.11283v411 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses the challenge of hallucination detection for open-ended text generation in scenarios where external knowledge is hard to access, though it is incremental as it builds on existing consistency-based methods.

The paper tackles the problem of detecting hallucinations in long, open-ended text generation without external resources by proposing a graph-based context-aware method that aligns knowledge triples and models their dependencies, achieving enhanced detection performance and outperforming all baselines in experiments.

LLMs obtain remarkable performance but suffer from hallucinations. Most research on detecting hallucination focuses on the questions with short and concrete correct answers that are easy to check the faithfulness. Hallucination detections for text generation with open-ended answers are more challenging. Some researchers use external knowledge to detect hallucinations in generated texts, but external resources for specific scenarios are hard to access. Recent studies on detecting hallucinations in long text without external resources conduct consistency comparison among multiple sampled outputs. To handle long texts, researchers split long texts into multiple facts and individually compare the consistency of each pairs of facts. However, these methods (1) hardly achieve alignment among multiple facts; (2) overlook dependencies between multiple contextual facts. In this paper, we propose a graph-based context-aware (GCA) hallucination detection for text generations, which aligns knowledge facts and considers the dependencies between contextual knowledge triples in consistency comparison. Particularly, to align multiple facts, we conduct a triple-oriented response segmentation to extract multiple knowledge triples. To model dependencies among contextual knowledge triple (facts), we construct contextual triple into a graph and enhance triples' interactions via message passing and aggregating via RGCN. To avoid the omission of knowledge triples in long text, we conduct a LLM-based reverse verification via reconstructing the knowledge triples. Experiments show that our model enhances hallucination detection and excels all baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes