AISYSYMay 26

Natural Language Query to Configuration for Retrieval Agents

arXiv:2605.2736185.0
Predicted impact top 28% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For practitioners deploying retrieval agents, BRANE offers a practical per-query optimization method that outperforms static tuning and existing routing baselines.

BRANE selects per-query retrieval pipeline configurations (LLM, retriever, etc.) to minimize cost or maximize accuracy, achieving up to 89% cost reduction while matching best fixed-configuration accuracy across three benchmarks.

Modern retrieval agents expose many configuration choices -- LLM, retriever, number of documents, number of hops, and synthesis strategy -- each shaping both answer quality and serving cost. Today, these pipelines are typically hand-tuned once per workload, leaving substantial per-query optimization untapped. We formulate the problem: given a natural-language query and either an accuracy or a budget target, select from a predefined pipeline catalog the configuration that minimizes cost or maximizes accuracy at inference time. We propose **BRANE**, which uses an LLM to convert each query into workload-specific characteristics, then trains a lightweight per-configuration predictor that estimates whether the pipeline will answer the query correctly. At inference time, **BRANE** selects the configuration that maximizes predicted correctness penalized by cost, exposing a tunable cost-quality tradeoff without retraining. Across MuSiQue, BrowseComp-Plus, and FinanceBench, **BRANE** consistently pushes the cost-quality Pareto frontier, matches the best fixed configuration's accuracy at up to 89% lower cost, and outperforms LLM-routing, rule-based, and fine-tuned Qwen3-4B baselines. These results show that per-query configuration of the full retrieval pipeline is a practical alternative to static workload-level tuning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes