Few-shot Reranking for Multi-hop QA via Language Model Prompting
This addresses the need for efficient retrieval in multi-hop QA with minimal labeled data, offering a competitive alternative to methods requiring thousands of examples.
The paper tackles the problem of few-shot reranking for multi-hop question answering by proposing PromptRank, which uses language model prompting to score document paths, achieving 73.6 recall@10 on HotpotQA with only 128 training examples.
We study few-shot reranking for multi-hop QA with open-domain questions. To alleviate the need for a large number of labeled question-document pairs for retriever training, we propose PromptRank, which relies on large language models prompting for multi-hop path reranking. PromptRank first constructs an instruction-based prompt that includes a candidate document path and then computes the relevance score between a given question and the path based on the conditional likelihood of the question given the path prompt according to a language model. PromptRank yields strong retrieval performance on HotpotQA with only 128 training examples compared to state-of-the-art methods trained on thousands of examples -- 73.6 recall@10 by PromptRank vs. 77.8 by PathRetriever and 77.5 by multi-hop dense retrieval. Code available at https://github.com/mukhal/PromptRank