IR AIApr 9, 2025

Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion

Jakub Podolak, Leon Peric, Mina Janicijevic, Roxana Petcu

arXiv:2504.10509v111.88 citationsh-index: 1SIGIR

Originality Incremental advance

AI Analysis

This addresses efficiency bottlenecks in zero-shot document reranking for LLM applications, though it is incremental.

This study reproduced and extended the Setwise prompting method for zero-shot LLM reranking, confirming its trade-offs between efficiency and effectiveness, and introduced Setwise Insertion, which reduced query time by 31% and model inferences by 23% while slightly improving reranking effectiveness.

This study presents a comprehensive reproducibility and extension analysis of the Setwise prompting methodology for zero-shot ranking with Large Language Models (LLMs), as proposed by Zhuang et al. We evaluate its effectiveness and efficiency compared to traditional Pointwise, Pairwise, and Listwise approaches in document ranking tasks. Our reproduction confirms the findings of Zhuang et al., highlighting the trade-offs between computational efficiency and ranking effectiveness in Setwise methods. Building on these insights, we introduce Setwise Insertion, a novel approach that leverages the initial document ranking as prior knowledge, reducing unnecessary comparisons and uncertainty by focusing on candidates more likely to improve the ranking results. Experimental results across multiple LLM architectures (Flan-T5, Vicuna, and Llama) show that Setwise Insertion yields a 31% reduction in query time, a 23% reduction in model inferences, and a slight improvement in reranking effectiveness compared to the original Setwise method. These findings highlight the practical advantage of incorporating prior ranking knowledge into Setwise prompting for efficient and accurate zero-shot document reranking.

View on arXiv PDF

Similar