IRApr 12

Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths

arXiv:2411.0557220.11 citationsh-index: 17
Predicted impact top 33% in IR · last 90 daysOriginality Incremental advance
AI Analysis

For users of generative retrieval systems, HyPE addresses the lack of explainability by offering step-by-step category paths as explanations.

HyPE enhances generative retrieval by first generating hierarchical category paths before decoding document identifiers, providing explainability for retrieval decisions while also improving retrieval performance.

Generative retrieval directly decode a document identifier (i.e., docid) in response to a query, making it impossible to provide users with explanations as an answer for ``why is this document retrieved?''. To address this limitation, we propose Hierarchical Category Path-Enhanced Generative Retrieval (HyPE), which enhances explainability by first generating hierarchical category paths step-by-step then decoding docid. By leveraging hierarchical category paths which progress from broader to more specific semantic categories, HyPE can provide detailed explanation for its retrieval decision. For training, HyPE constructs category paths with external high-quality semantic hierarchy, leverages LLM to select appropriate candidate paths for each document, and optimizes the generative retrieval model with path-augmented dataset. During inference, HyPE utilizes path-aware ranking strategy to aggregate diverse topic information, allowing the most relevant documents to be prioritized in the final ranked list of docids. Our extensive experiments demonstrate that HyPE not only offers a high level of explainability but also improves the retrieval performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes