CLNov 16, 2017

Crowdsourcing Question-Answer Meaning Representations

arXiv:1711.05885v11124 citations
Originality Incremental advance
AI Analysis

This provides a new crowdsourced dataset for natural language processing researchers to model complex predicate-argument phenomena, though it is incremental in building on existing representation methods.

The authors tackled the problem of representing predicate-argument structures in sentences by introducing Question-Answer Meaning Representations (QAMRs) as sets of question-answer pairs, and they crowdsourced a dataset with over 5,000 sentences and 100,000 questions, showing that QAMRs cover most relationships in existing datasets and many under-resourced ones.

We introduce Question-Answer Meaning Representations (QAMRs), which represent the predicate-argument structure of a sentence as a set of question-answer pairs. We also develop a crowdsourcing scheme to show that QAMRs can be labeled with very little training, and gather a dataset with over 5,000 sentences and 100,000 questions. A detailed qualitative analysis demonstrates that the crowd-generated question-answer pairs cover the vast majority of predicate-argument relationships in existing datasets (including PropBank, NomBank, QA-SRL, and AMR) along with many previously under-resourced ones, including implicit arguments and relations. The QAMR data and annotation code is made publicly available to enable future work on how best to model these complex phenomena.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes