IRCLLGApr 3

Lightweight Query Routing for Adaptive RAG: A Baseline Study on RAGRouter-Bench

arXiv:2604.0345538.7h-index: 3
Predicted impact top 64% in IR · last 90 daysOriginality Synthesis-oriented
AI Analysis

It establishes a strong baseline for query routing in RAG systems, showing that simple lexical features outperform semantic embeddings, which is important for practical efficiency.

The paper evaluates lightweight classifiers for routing queries to appropriate RAG strategies on the RAGRouter-Bench benchmark, achieving 93.2% accuracy and 28.1% token savings with TF-IDF + SVM.

Retrieval-Augmented Generation pipelines span a wide range of retrieval strategies that differ substantially in token cost and capability. Selecting the right strategy per query is a practical efficiency problem, yet no routing classifiers have been trained on RAGRouter-Bench \citep{wang2026ragrouterbench}, a recently released benchmark of $7,727$ queries spanning four knowledge domains, each annotated with one of three canonical query types: factual, reasoning, and summarization. We present the first systematic evaluation of lightweight classifier-based routing on this benchmark. Five classical classifiers are evaluated under three feature regimes, namely, TF-IDF, MiniLM sentence embeddings \citep{reimers2019sbert}, and hand-crafted structural features, yielding 15 classifier feature combinations. Our best configuration, TF-IDF with an SVM, achieves a macro-averaged F1 of $\mathbf{0.928}$ and an accuracy of $\mathbf{93.2\%}$, while simulating $\mathbf{28.1\%}$ token savings relative to always using the most expensive paradigm. Lexical TF-IDF features outperform semantic sentence embeddings by $3.1$ macro-F1 points, suggesting that surface keyword patterns are strong predictors of query-type complexity. Domain-level analysis reveals that medical queries are hardest to route and legal queries most tractable. These results establish a reproducible query-side baseline and highlight the gap that corpus-aware routing must close.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes