Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

Hangeol Chang, Changsun Lee, Seungjoon Rho, Junho Yeo, Jong Chul Ye

arXiv:2603.1900895.5h-index: 2Has Code

Predicted impact top 11% in CL · last 90 daysOriginality Incremental advance

AI Analysis

This addresses a bottleneck in RAG systems for decision-making tasks, offering a domain-specific improvement for applications like medical QA.

The paper tackles the problem of retrieval-augmented generation (RAG) failing to provide decision-relevant evidence for tasks with competing options, by proposing Hypothesis-Conditioned Query Rewriting (HCQR), a training-free framework that rewrites queries to retrieve targeted evidence, resulting in accuracy improvements of 5.9 and 3.6 points over Simple RAG on MedQA and MMLU-Med datasets.

Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by grounding generation in external, non-parametric knowledge. However, when a task requires choosing among competing options, simply grounding generation in broadly relevant context is often insufficient to drive the final decision. Existing RAG methods typically rely on a single initial query, which often favors topical relevance over decision-relevant evidence, and therefore retrieves background information that can fail to discriminate among answer options. To address this issue, here we propose Hypothesis-Conditioned Query Rewriting (HCQR), a training-free pre-retrieval framework that reorients RAG from topic-oriented retrieval to evidence-oriented retrieval. HCQR first derives a lightweight working hypothesis from the input question and candidate options, and then rewrites retrieval into three targeted queries that seek evidence to: (1) support the hypothesis, (2) distinguish it from competing alternatives, and (3) verify salient clues in the question. This approach enables context retrieval that is more directly aligned with answer selection, allowing the generator to confirm or overturn the initial hypothesis based on the retrieved evidence. Experiments on MedQA and MMLU-Med show that HCQR consistently outperforms single-query RAG and re-rank/filter baselines, improving average accuracy over Simple RAG by 5.9 and 3.6 points, respectively. Code is available at https://anonymous.4open.science/r/HCQR-1C2E.

View on arXiv PDF

Similar