IRCLLGSep 26, 2024

Few-shot Prompting for Pairwise Ranking: An Effective Non-Parametric Retrieval Model

arXiv:2409.17745v324 citationsh-index: 7
AI Analysis

This work addresses the problem of simplifying retrieval pipelines for researchers and practitioners by reducing the need for extensive training while maintaining competitive performance, though it is incremental as it builds on existing few-shot methods.

The paper tackled the performance gap between zero-shot and supervised ranking models by introducing a pairwise few-shot ranker that uses training examples to improve preference prediction, achieving consistent improvements over zero-shot baselines on TREC DL and BEIR benchmarks and close performance to supervised models without complex training.

A supervised ranking model, despite its advantage of being effective, usually involves complex processing - typically multiple stages of task-specific pre-training and fine-tuning. This has motivated researchers to explore simpler pipelines leveraging large language models (LLMs) that are capable of working in a zero-shot manner. However, since zero-shot inference does not make use of a training set of pairs of queries and their relevant documents, its performance is mostly worse than that of supervised models, which are trained on such example pairs. Motivated by the existing findings that training examples generally improve zero-shot performance, in our work, we explore if this also applies to ranking models. More specifically, given a query and a pair of documents, the preference prediction task is improved by augmenting examples of preferences for similar queries from a training set. Our proposed pairwise few-shot ranker demonstrates consistent improvements over the zero-shot baseline on both in-domain (TREC DL) and out-domain (BEIR subset) retrieval benchmarks. Our method also achieves a close performance to that of a supervised model without requiring any complex training pipeline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes