CLAIMar 9, 2023

Can a Frozen Pretrained Language Model be used for Zero-shot Neural Retrieval on Entity-centric Questions?

arXiv:2303.05153v1h-index: 14
Originality Incremental advance
AI Analysis

This addresses the generalization issue in retrieval for entity-centric questions in Wikipedia, offering a zero-shot approach that avoids fine-tuning, though it is incremental as it builds on existing methods.

The paper tackled the problem of neural document retrievers, like DPR, struggling to generalize on entity-centric questions in Wikipedia, and proposed a zero-shot method using frozen pretrained language model embeddings, which outperformed DPR and achieved performance comparable to BM25 and SPAR on such questions.

Neural document retrievers, including dense passage retrieval (DPR), have outperformed classical lexical-matching retrievers, such as BM25, when fine-tuned and tested on specific question-answering datasets. However, it has been shown that the existing dense retrievers do not generalize well not only out of domain but even in domain such as Wikipedia, especially when a named entity in a question is a dominant clue for retrieval. In this paper, we propose an approach toward in-domain generalization using the embeddings generated by the frozen language model trained with the entities in the domain. By not fine-tuning, we explore the possibility that the rich knowledge contained in a pretrained language model can be used for retrieval tasks. The proposed method outperforms conventional DPRs on entity-centric questions in Wikipedia domain and achieves almost comparable performance to BM25 and state-of-the-art SPAR model. We also show that the contextualized keys lead to strong improvements compared to BM25 when the entity names consist of common words. Our results demonstrate the feasibility of the zero-shot retrieval method for entity-centric questions of Wikipedia domain, where DPR has struggled to perform.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes