CLMay 19, 2025

GuRE:Generative Query REwriter for Legal Passage Retrieval

arXiv:2505.12950v24 citationsh-index: 3Has CodeProceedings of the Natural Legal Language Processing Workshop 2025
Originality Incremental advance
AI Analysis

This addresses a specific problem for legal practitioners by enhancing retrieval efficiency, though it is incremental as it applies existing LLM techniques to a new domain.

The paper tackles the vocabulary mismatch problem in Legal Passage Retrieval (LPR) by proposing GuRE, a generative query rewriter using LLMs, which significantly improves retrieval performance in a retriever-agnostic manner and outperforms all baseline methods.

Legal Passage Retrieval (LPR) systems are crucial as they help practitioners save time when drafting legal arguments. However, it remains an underexplored avenue. One primary reason is the significant vocabulary mismatch between the query and the target passage. To address this, we propose a simple yet effective method, the Generative query REwriter (GuRE). We leverage the generative capabilities of Large Language Models (LLMs) by training the LLM for query rewriting. "Rewritten queries" help retrievers to retrieve target passages by mitigating vocabulary mismatch. Experimental results show that GuRE significantly improves performance in a retriever-agnostic manner, outperforming all baseline methods. Further analysis reveals that different training objectives lead to distinct retrieval behaviors, making GuRE more suitable than direct retriever fine-tuning for real-world applications. Codes are avaiable at github.com/daehuikim/GuRE.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes