IRAICLLGMay 12, 2024

Bottleneck-Minimal Indexing for Generative Document Retrieval

arXiv:2405.10974v22 citationsh-index: 3ICML
Originality Incremental advance
AI Analysis

This work addresses retrieval efficiency for document search systems, presenting an incremental improvement with a novel theoretical perspective.

The paper tackles the problem of generative document retrieval by analyzing indexing as an information bottleneck and proposes a bottleneck-minimal indexing method, which outperforms previous methods on NQ320K and MARCO datasets.

We apply an information-theoretic perspective to reconsider generative document retrieval (GDR), in which a document $x \in X$ is indexed by $t \in T$, and a neural autoregressive model is trained to map queries $Q$ to $T$. GDR can be considered to involve information transmission from documents $X$ to queries $Q$, with the requirement to transmit more bits via the indexes $T$. By applying Shannon's rate-distortion theory, the optimality of indexing can be analyzed in terms of the mutual information, and the design of the indexes $T$ can then be regarded as a {\em bottleneck} in GDR. After reformulating GDR from this perspective, we empirically quantify the bottleneck underlying GDR. Finally, using the NQ320K and MARCO datasets, we evaluate our proposed bottleneck-minimal indexing method in comparison with various previous indexing methods, and we show that it outperforms those methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes