IRLGPFOct 18, 2025

Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades

arXiv:2510.16393v1h-index: 26
Originality Incremental advance
AI Analysis

This work improves retrieval effectiveness for search systems by combining dense and lexical features, though it is incremental as it builds on existing dense retrievers and LTR methods.

The paper tackles ad-hoc passage retrieval by blending lexical and neural relevance signals using a Learning-to-Rank model, achieving up to an 11% boost in nDCG@10 with only a 4.3% increase in query latency.

We investigate the exploitation of both lexical and neural relevance signals for ad-hoc passage retrieval. Our exploration involves a large-scale training dataset in which dense neural representations of MS-MARCO queries and passages are complemented and integrated with 253 hand-crafted lexical features extracted from the same corpus. Blending of the relevance signals from the two different groups of features is learned by a classical Learning-to-Rank (LTR) model based on a forest of decision trees. To evaluate our solution, we employ a pipelined architecture where a dense neural retriever serves as the first stage and performs a nearest-neighbor search over the neural representations of the documents. Our LTR model acts instead as the second stage that re-ranks the set of candidates retrieved by the first stage to enhance effectiveness. The results of reproducible experiments conducted with state-of-the-art dense retrievers on publicly available resources show that the proposed solution significantly enhances the end-to-end ranking performance while relatively minimally impacting efficiency. Specifically, we achieve a boost in nDCG@10 of up to 11% with an increase in average query latency of only 4.3%. This confirms the advantage of seamlessly combining two distinct families of signals that mutually contribute to retrieval effectiveness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes