CLNov 1, 2024

PrefRAG: Preference-Driven Multi-Source Retrieval Augmented Generation

Tsinghua
arXiv:2411.00689v21 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of effectively and controllably exploring different retrieval sources in RAG systems for improving LLM responses, representing an incremental advance in adaptive RAG methods.

The paper tackles the problem of multi-source retrieval-augmented generation (RAG) by introducing PrefRAG, a system that enables controllable exploration of diverse retrieval sources, resulting in performance improvements of up to 25.6% over Vanilla RAG and 13.9% over MS-ARAG.

Retrieval-Augmented Generation (RAG) has emerged as a reliable external knowledge augmentation technique to mitigate hallucination issues and parameterized knowledge limitations in Large Language Models (LLMs). Existing adaptive RAG (ARAG) systems excel at in-depth exploration within a single source but struggle to effectively and controllably explore different retrieval sources, as they fail to foresee their internal knowledge features. We develop a novel multi-source ARAG system, PrefRAG, which enhances RAG by enabling in-depth and controllable exploration of diverse retrieval sources through preference-driven adaptive retrieval and self-reflection. PrefRAG first fully explores controllable local sources in adaptive retrieval and supplements with the web when appropriate, ultimately selecting the optimal source for knowledge observation. Subsequently, PrefRAG feeds answer quality feedback into the retrieval process, optimizing it from the generation perspective to produce higher-quality responses. Extensive experiments confirm its superiority, high retrieval efficiency, and knowledge controllability. PrefRAG outperforms Vanilla RAG and the leading MS-ARAG by up to 25.6% and 13.9% respectively. Additionally, PrefRAG trained with DPO achieves higher performance. The code and data are available at https://github.com/QingFei1/PrefRAG.git.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes