CLLGMay 11, 2023

Active Retrieval Augmented Generation

arXiv:2305.06983v2669 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the limitation of single-retrieval methods in long-form generation for applications requiring factual accuracy, though it is an incremental improvement over existing retrieval-augmented approaches.

The paper tackles the problem of factual inaccuracies in long-form text generation by large language models, proposing FLARE, an active retrieval method that anticipates future content and retrieves documents when needed, achieving superior or competitive performance on all 4 tested long-form knowledge-intensive generation tasks.

Despite the remarkable ability of large language models (LMs) to comprehend and generate language, they have a tendency to hallucinate and create factually inaccurate output. Augmenting LMs by retrieving information from external knowledge resources is one promising solution. Most existing retrieval augmented LMs employ a retrieve-and-generate setup that only retrieves information once based on the input. This is limiting, however, in more general scenarios involving generation of long texts, where continually gathering information throughout generation is essential. In this work, we provide a generalized view of active retrieval augmented generation, methods that actively decide when and what to retrieve across the course of the generation. We propose Forward-Looking Active REtrieval augmented generation (FLARE), a generic method which iteratively uses a prediction of the upcoming sentence to anticipate future content, which is then utilized as a query to retrieve relevant documents to regenerate the sentence if it contains low-confidence tokens. We test FLARE along with baselines comprehensively over 4 long-form knowledge-intensive generation tasks/datasets. FLARE achieves superior or competitive performance on all tasks, demonstrating the effectiveness of our method. Code and datasets are available at https://github.com/jzbjyb/FLARE.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes