CLIRAug 20, 2024

Analysis of Plan-based Retrieval for Grounded Text Generation

arXiv:2408.10490v125 citationsh-index: 42
Originality Incremental advance
AI Analysis

This addresses hallucinations in language models for long-form text generation, but it is incremental as it builds on existing retrieval methods.

The paper tackles the problem of hallucinations in text generation by using plan-guided retrieval to improve fact coverage, resulting in more informative responses and a higher attribution rate to source documents.

In text generation, hallucinations refer to the generation of seemingly coherent text that contradicts established knowledge. One compelling hypothesis is that hallucinations occur when a language model is given a generation task outside its parametric knowledge (due to rarity, recency, domain, etc.). A common strategy to address this limitation is to infuse the language models with retrieval mechanisms, providing the model with relevant knowledge for the task. In this paper, we leverage the planning capabilities of instruction-tuned LLMs and analyze how planning can be used to guide retrieval to further reduce the frequency of hallucinations. We empirically evaluate several variations of our proposed approach on long-form text generation tasks. By improving the coverage of relevant facts, plan-guided retrieval and generation can produce more informative responses while providing a higher rate of attribution to source documents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes