IRCLJun 3, 2024

A Survey of Generative Information Retrieval

arXiv:2406.01197v25 citationsHas Code
AI Analysis

It provides a foundational overview for researchers in information retrieval, but is incremental as it surveys existing work rather than introducing new methods.

This survey tackles the emerging paradigm of Generative Retrieval (GR), which uses generative models to directly map queries to document identifiers, bypassing traditional retrieval steps, and outlines future research directions to advance the field.

Generative Retrieval (GR) is an emerging paradigm in information retrieval that leverages generative models to directly map queries to relevant document identifiers (DocIDs) without the need for traditional query processing or document reranking. This survey provides a comprehensive overview of GR, highlighting key developments, indexing and retrieval strategies, and challenges. We discuss various document identifier strategies, including numerical and string-based identifiers, and explore different document representation methods. Our primary contribution lies in outlining future research directions that could profoundly impact the field: improving the quality of query generation, exploring learnable document identifiers, enhancing scalability, and integrating GR with multi-task learning frameworks. By examining state-of-the-art GR techniques and their applications, this survey aims to provide a foundational understanding of GR and inspire further innovations in this transformative approach to information retrieval. We also make the complementary materials such as paper collection publicly available at https://github.com/MiuLab/GenIR-Survey/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes