IRCLMay 20, 2025

Rank-K: Test-Time Reasoning for Listwise Reranking

arXiv:2505.14432v134 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses efficiency and effectiveness challenges in retrieval systems for users needing scalable, multilingual query processing, though it is incremental as it builds on existing listwise reranking methods.

The paper tackles the resource-intensive nature of neural rerankers in retrieval pipelines by introducing Rank-K, a listwise passage reranking model that leverages reasoning language models at query time, improving retrieval effectiveness by 23% over the state-of-the-art RankZephyr on BM25 results and 19% on SPLADE-v3 results.

Retrieve-and-rerank is a popular retrieval pipeline because of its ability to make slow but effective rerankers efficient enough at query time by reducing the number of comparisons. Recent works in neural rerankers take advantage of large language models for their capability in reasoning between queries and passages and have achieved state-of-the-art retrieval effectiveness. However, such rerankers are resource-intensive, even after heavy optimization. In this work, we introduce Rank-K, a listwise passage reranking model that leverages the reasoning capability of the reasoning language model at query time that provides test time scalability to serve hard queries. We show that Rank-K improves retrieval effectiveness by 23\% over the RankZephyr, the state-of-the-art listwise reranker, when reranking a BM25 initial ranked list and 19\% when reranking strong retrieval results by SPLADE-v3. Since Rank-K is inherently a multilingual model, we found that it ranks passages based on queries in different languages as effectively as it does in monolingual retrieval.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes