CLJul 17, 2024

Crafting the Path: Robust Query Rewriting for Information Retrieval

arXiv:2407.12529v214 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses the issue of query rewriting in information retrieval systems, particularly for scenarios where LLMs lack internal knowledge, offering a robust solution that is less dependent on model parameters.

The paper tackles the problem of query rewriting for information retrieval by proposing a structured method called Crafting the Path, which improves performance over previous methods, especially in less familiar domains for LLMs, with fewer factual inaccuracies.

Query rewriting aims to generate a new query that can complement the original query to improve the information retrieval system. Recent studies on query rewriting, such as query2doc, query2expand and querey2cot, rely on the internal knowledge of Large Language Models (LLMs) to generate a relevant passage to add information to the query. Nevertheless, the efficacy of these methodologies may markedly decline in instances where the requisite knowledge is not encapsulated within the model's intrinsic parameters. In this paper, we propose a novel structured query rewriting method called Crafting the Path tailored for retrieval systems. Crafting the Path involves a three-step process that crafts query-related information necessary for finding the passages to be searched in each step. Specifically, the Crafting the Path begins with Query Concept Comprehension, proceeds to Query Type Identification, and finally conducts Expected Answer Extraction. Experimental results show that our method outperforms previous rewriting methods, especially in less familiar domains for LLMs. We demonstrate that our method is less dependent on the internal parameter knowledge of the model and generates queries with fewer factual inaccuracies. Furthermore, we observe that \name{} demonstrates superior performance in the retrieval-augmented generation scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes