IRAICLApr 28, 2024

CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

arXiv:2405.15784v114 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the challenge of ambiguous queries in retrieval systems for users, representing an incremental advance by augmenting LLMs with retrieval conditioning.

The paper tackles the problem of ambiguous user requests in information retrieval by developing CLARINET, a system that asks clarification questions to improve retrieval accuracy, resulting in a 17% improvement over traditional heuristics and a 39% improvement over vanilla-prompted LLMs on a book search dataset.

Users often make ambiguous requests that require clarification. We study the problem of asking clarification questions in an information retrieval setting, where systems often face ambiguous search queries and it is challenging to turn the uncertainty in the retrieval model into a natural language question. We present CLARINET, a system that asks informative clarification questions by choosing questions whose answers would maximize certainty in the correct candidate. Our approach works by augmenting a large language model (LLM) to condition on a retrieval distribution, finetuning end-to-end to generate the question that would have maximized the rank of the true candidate at each turn. When evaluated on a real-world retrieval dataset of users searching for books, our system outperforms traditional heuristics such as information gain on retrieval success by 17% and vanilla-prompted LLMs by 39% relative.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes