CLIROct 17, 2024

Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval

arXiv:2410.13765v233 citationsh-index: 18NAACL
Originality Highly original
AI Analysis

This addresses the limitation of existing query expansion methods that overlook document relations, improving retrieval for semi-structured queries in domains like product search.

The paper tackles the problem of semi-structured queries requiring both textual and relational information by proposing a knowledge-aware query expansion framework that augments large language models with structured document relations from a knowledge graph, achieving advantages over state-of-the-art baselines on three diverse datasets.

Large language models (LLMs) have been used to generate query expansions augmenting original queries for improving information search. Recent studies also explore providing LLMs with initial retrieval results to generate query expansions more grounded to document corpus. However, these methods mostly focus on enhancing textual similarities between search queries and target documents, overlooking document relations. For queries like "Find me a highly rated camera for wildlife photography compatible with my Nikon F-Mount lenses", existing methods may generate expansions that are semantically similar but structurally unrelated to user intents. To handle such semi-structured queries with both textual and relational requirements, in this paper we propose a knowledge-aware query expansion framework, augmenting LLMs with structured document relations from knowledge graph (KG). To further address the limitation of entity-based scoring in existing KG-based methods, we leverage document texts as rich KG node representations and use document-based relation filtering for our Knowledge-Aware Retrieval (KAR). Extensive experiments on three datasets of diverse domains show the advantages of our method compared against state-of-the-art baselines on textual and relational semi-structured retrieval.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes