CL IRAug 20, 2024

Analysis of Plan-based Retrieval for Grounded Text Generation

Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer

arXiv:2408.10490v114.125 citationsh-index: 42

Originality Incremental advance

AI Analysis

This addresses hallucinations in language models for long-form text generation, but it is incremental as it builds on existing retrieval methods.

The paper tackles the problem of hallucinations in text generation by using plan-guided retrieval to improve fact coverage, resulting in more informative responses and a higher attribution rate to source documents.

In text generation, hallucinations refer to the generation of seemingly coherent text that contradicts established knowledge. One compelling hypothesis is that hallucinations occur when a language model is given a generation task outside its parametric knowledge (due to rarity, recency, domain, etc.). A common strategy to address this limitation is to infuse the language models with retrieval mechanisms, providing the model with relevant knowledge for the task. In this paper, we leverage the planning capabilities of instruction-tuned LLMs and analyze how planning can be used to guide retrieval to further reduce the frequency of hallucinations. We empirically evaluate several variations of our proposed approach on long-form text generation tasks. By improving the coverage of relevant facts, plan-guided retrieval and generation can produce more informative responses while providing a higher rate of attribution to source documents.

View on arXiv PDF

Similar