CLAIApr 22, 2020

AmbigQA: Answering Ambiguous Open-domain Questions

arXiv:2004.10645v21116 citations
AI Analysis

It addresses ambiguity in open-domain QA, a common issue when exploring new topics, by providing a new task and dataset to support future research.

The paper introduces AmbigQA, a task for answering ambiguous open-domain questions by finding all plausible answers and rewriting questions to resolve ambiguity, and constructs AmbigNQ, a dataset of 14,042 questions showing over half are ambiguous.

Ambiguity is inherent to open-domain question answering; especially when exploring new topics, it can be difficult to ask questions that have a single, unambiguous answer. In this paper, we introduce AmbigQA, a new open-domain question answering task which involves finding every plausible answer, and then rewriting the question for each one to resolve the ambiguity. To study this task, we construct AmbigNQ, a dataset covering 14,042 questions from NQ-open, an existing open-domain QA benchmark. We find that over half of the questions in NQ-open are ambiguous, with diverse sources of ambiguity such as event and entity references. We also present strong baseline models for AmbigQA which we show benefit from weakly supervised learning that incorporates NQ-open, strongly suggesting our new task and data will support significant future research effort. Our data and baselines are available at https://nlp.cs.washington.edu/ambigqa.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes