IR AI CLApr 28, 2024

CLARINET: Augmenting Language Models to Ask Clarification Questions for Retrieval

Yizhou Chi, Jessy Lin, Kevin Lin, Dan Klein

arXiv:2405.15784v114.814 citationsh-index: 3

Originality Incremental advance

AI Analysis

This addresses the challenge of ambiguous queries in retrieval systems for users, representing an incremental advance by augmenting LLMs with retrieval conditioning.

The paper tackles the problem of ambiguous user requests in information retrieval by developing CLARINET, a system that asks clarification questions to improve retrieval accuracy, resulting in a 17% improvement over traditional heuristics and a 39% improvement over vanilla-prompted LLMs on a book search dataset.

Users often make ambiguous requests that require clarification. We study the problem of asking clarification questions in an information retrieval setting, where systems often face ambiguous search queries and it is challenging to turn the uncertainty in the retrieval model into a natural language question. We present CLARINET, a system that asks informative clarification questions by choosing questions whose answers would maximize certainty in the correct candidate. Our approach works by augmenting a large language model (LLM) to condition on a retrieval distribution, finetuning end-to-end to generate the question that would have maximized the rank of the true candidate at each turn. When evaluated on a real-world retrieval dataset of users searching for books, our system outperforms traditional heuristics such as information gain on retrieval success by 17% and vanilla-prompted LLMs by 39% relative.

View on arXiv PDF

Similar