CLOct 15, 2024

LargePiG: Your Large Language Model is Secretly a Pointer Generator

arXiv:2410.11366v1h-index: 19
Originality Incremental advance
AI Analysis

This addresses hallucination problems in query generation for users of LLMs, representing an incremental improvement by adapting existing pointer-generator concepts to LLMs.

The paper tackles hallucination issues in query generation with Large Language Models by introducing a model-agnostic, training-free method called LargePiG that separates content from form, and it demonstrates superiority in reducing hallucinations and improving accuracy on constructed datasets for document and video scenarios.

Recent research on query generation has focused on using Large Language Models (LLMs), which despite bringing state-of-the-art performance, also introduce issues with hallucinations in the generated queries. In this work, we introduce relevance hallucination and factuality hallucination as a new typology for hallucination problems brought by query generation based on LLMs. We propose an effective way to separate content from form in LLM-generated queries, which preserves the factual knowledge extracted and integrated from the inputs and compiles the syntactic structure, including function words, using the powerful linguistic capabilities of the LLM. Specifically, we introduce a model-agnostic and training-free method that turns the Large Language Model into a Pointer-Generator (LargePiG), where the pointer attention distribution leverages the LLM's inherent attention weights, and the copy probability is derived from the difference between the vocabulary distribution of the model's high layers and the last layer. To validate the effectiveness of LargePiG, we constructed two datasets for assessing the hallucination problems in query generation, covering both document and video scenarios. Empirical studies on various LLMs demonstrated the superiority of LargePiG on both datasets. Additional experiments also verified that LargePiG could reduce hallucination in large vision language models and improve the accuracy of document-based question-answering and factuality evaluation tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes