CLIRApr 10, 2025

Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation

arXiv:2504.07794v1h-index: 12Has Code
Originality Highly original
AI Analysis

It addresses limitations in response diversity and comprehensiveness for information-seeking tasks, representing a novel method for a known bottleneck.

This paper tackles the problem of generating diverse and comprehensive responses in retrieval-augmented LLMs by introducing the Plan-and-Refine framework, which achieves up to 13.1% and 15.41% improvements on benchmarks.

This paper studies the limitations of (retrieval-augmented) large language models (LLMs) in generating diverse and comprehensive responses, and introduces the Plan-and-Refine (P&R) framework based on a two phase system design. In the global exploration phase, P&R generates a diverse set of plans for the given input, where each plan consists of a list of diverse query aspects with corresponding additional descriptions. This phase is followed by a local exploitation phase that generates a response proposal for the input query conditioned on each plan and iteratively refines the proposal for improving the proposal quality. Finally, a reward model is employed to select the proposal with the highest factuality and coverage. We conduct our experiments based on the ICAT evaluation methodology--a recent approach for answer factuality and comprehensiveness evaluation. Experiments on the two diverse information seeking benchmarks adopted from non-factoid question answering and TREC search result diversification tasks demonstrate that P&R significantly outperforms baselines, achieving up to a 13.1% improvement on the ANTIQUE dataset and a 15.41% improvement on the TREC dataset. Furthermore, a smaller scale user study confirms the substantial efficacy of the P&R framework.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes