CL AI IRJul 15, 2019

Asking Clarifying Questions in Open-Domain Information-Seeking Conversations

Mohammad Aliannejadi, Hamed Zamani, Fabio Crestani, W. Bruce Croft

arXiv:1907.06554v1401 citations

Originality Incremental advance

AI Analysis

This addresses the issue of user frustration in open-domain information-seeking conversations, offering a novel approach to improve system effectiveness, though it is incremental in building on existing datasets and methods.

The paper tackles the problem of users struggling to express complex information needs in conversational search systems by proposing a framework for asking clarifying questions, which leads to over 170% improvement in retrieval performance (P@1) with just one good question.

Users often fail to formulate their complex information needs in a single query. As a consequence, they may need to scan multiple result pages or reformulate their queries, which may be a frustrating experience. Alternatively, systems can improve user satisfaction by proactively asking questions of the users to clarify their information needs. Asking clarifying questions is especially important in conversational systems since they can only return a limited number of (often only one) result(s). In this paper, we formulate the task of asking clarifying questions in open-domain information-seeking conversational systems. To this end, we propose an offline evaluation methodology for the task and collect a dataset, called Qulac, through crowdsourcing. Our dataset is built on top of the TREC Web Track 2009-2012 data and consists of over 10K question-answer pairs for 198 TREC topics with 762 facets. Our experiments on an oracle model demonstrate that asking only one good question leads to over 170% retrieval performance improvement in terms of P@1, which clearly demonstrates the potential impact of the task. We further propose a retrieval framework consisting of three components: question retrieval, question selection, and document retrieval. In particular, our question selection model takes into account the original query and previous question-answer interactions while selecting the next question. Our model significantly outperforms competitive baselines. To foster research in this area, we have made Qulac publicly available.

View on arXiv PDF

Similar