CLJan 23

DF-RAG: Query-Aware Diversity for Retrieval-Augmented Generation

arXiv:2601.17212v12 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses a bottleneck in RAG systems for complex QA tasks, though it is incremental as it builds on existing frameworks.

The paper tackles the problem of redundant content in retrieval-augmented generation (RAG) for reasoning-intensive question-answering by introducing DF-RAG, which incorporates query-aware diversity into retrieval, resulting in 4-10 percent F1 performance improvements over vanilla RAG.

Retrieval-augmented generation (RAG) is a common technique for grounding language model outputs in domain-specific information. However, RAG is often challenged by reasoning-intensive question-answering (QA), since common retrieval methods like cosine similarity maximize relevance at the cost of introducing redundant content, which can reduce information recall. To address this, we introduce Diversity-Focused Retrieval-Augmented Generation (DF-RAG), which systematically incorporates diversity into the retrieval step to improve performance on complex, reasoning-intensive QA benchmarks. DF-RAG builds upon the Maximal Marginal Relevance framework to select information chunks that are both relevant to the query and maximally dissimilar from each other. A key innovation of DF-RAG is its ability to optimize the level of diversity for each query dynamically at test time without requiring any additional fine-tuning or prior information. We show that DF-RAG improves F1 performance on reasoning-intensive QA benchmarks by 4-10 percent over vanilla RAG using cosine similarity and also outperforms other established baselines. Furthermore, we estimate an Oracle ceiling of up to 18 percent absolute F1 gains over vanilla RAG, of which DF-RAG captures up to 91.3 percent.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes