CLNov 14, 2022

Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA

arXiv:2211.07516v2231 citationsh-index: 60
Originality Incremental advance
AI Analysis

This work addresses ambiguity resolution in visual question answering, which is crucial for improving AI systems' ability to handle natural language queries, though it is incremental as it builds on existing VQA methods.

The paper tackled the problem of ambiguous questions in visual question answering by creating a dataset of ambiguous examples, analyzing reasons for ambiguity, and developing a question-generation model that produces less ambiguous questions, with results validated through automatic and human evaluation.

Natural language is ambiguous. Resolving ambiguous questions is key to successfully answering them. Focusing on questions about images, we create a dataset of ambiguous examples. We annotate these, grouping answers by the underlying question they address and rephrasing the question for each group to reduce ambiguity. Our analysis reveals a linguistically-aligned ontology of reasons for ambiguity in visual questions. We then develop an English question-generation model which we demonstrate via automatic and human evaluation produces less ambiguous questions. We further show that the question generation objective we use allows the model to integrate answer group information without any direct supervision.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes