AIOct 21, 2025

Query Decomposition for RAG: Balancing Exploration-Exploitation

arXiv:2510.18633v13 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the trade-off between broad retrieval and noise reduction in RAG systems for complex user requests, representing an incremental improvement.

The paper tackles the problem of efficiently selecting informative documents in retrieval-augmented generation (RAG) systems by balancing exploration and exploitation in query decomposition, resulting in a 35% gain in document-level precision and a 15% increase in α-nDCG.

Retrieval-augmented generation (RAG) systems address complex user requests by decomposing them into subqueries, retrieving potentially relevant documents for each, and then aggregating them to generate an answer. Efficiently selecting informative documents requires balancing a key trade-off: (i) retrieving broadly enough to capture all the relevant material, and (ii) limiting retrieval to avoid excessive noise and computational cost. We formulate query decomposition and document retrieval in an exploitation-exploration setting, where retrieving one document at a time builds a belief about the utility of a given sub-query and informs the decision to continue exploiting or exploring an alternative. We experiment with a variety of bandit learning methods and demonstrate their effectiveness in dynamically selecting the most informative sub-queries. Our main finding is that estimating document relevance using rank information and human judgments yields a 35% gain in document-level precision, 15% increase in α-nDCG, and better performance on the downstream task of long-form generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes