CLNov 16, 2017

Crowdsourcing Question-Answer Meaning Representations

Julian Michael, Gabriel Stanovsky, Luheng He, Ido Dagan, Luke Zettlemoyer

arXiv:1711.05885v139.61124 citationsHas Code

Originality Incremental advance

AI Analysis

This provides a new crowdsourced dataset for natural language processing researchers to model complex predicate-argument phenomena, though it is incremental in building on existing representation methods.

The authors tackled the problem of representing predicate-argument structures in sentences by introducing Question-Answer Meaning Representations (QAMRs) as sets of question-answer pairs, and they crowdsourced a dataset with over 5,000 sentences and 100,000 questions, showing that QAMRs cover most relationships in existing datasets and many under-resourced ones.

We introduce Question-Answer Meaning Representations (QAMRs), which represent the predicate-argument structure of a sentence as a set of question-answer pairs. We also develop a crowdsourcing scheme to show that QAMRs can be labeled with very little training, and gather a dataset with over 5,000 sentences and 100,000 questions. A detailed qualitative analysis demonstrates that the crowd-generated question-answer pairs cover the vast majority of predicate-argument relationships in existing datasets (including PropBank, NomBank, QA-SRL, and AMR) along with many previously under-resourced ones, including implicit arguments and relations. The QAMR data and annotation code is made publicly available to enable future work on how best to model these complex phenomena.

View on arXiv PDF Code

Similar