CLNov 26, 2019

A Large-scale Dataset for Argument Quality Ranking: Construction and Analysis

arXiv:1911.11408v1153 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the need for better argument quality assessment in computational argumentation, though it is incremental as it builds on existing datasets and methods.

The authors tackled the problem of ranking argument quality by constructing a large-scale dataset of 30,497 annotated arguments, which is five times larger than previous datasets, and developed a neural method that outperforms baselines on this and another dataset.

Identifying the quality of free-text arguments has become an important task in the rapidly expanding field of computational argumentation. In this work, we explore the challenging task of argument quality ranking. To this end, we created a corpus of 30,497 arguments carefully annotated for point-wise quality, released as part of this work. To the best of our knowledge, this is the largest dataset annotated for point-wise argument quality, larger by a factor of five than previously released datasets. Moreover, we address the core issue of inducing a labeled score from crowd annotations by performing a comprehensive evaluation of different approaches to this problem. In addition, we analyze the quality dimensions that characterize this dataset. Finally, we present a neural method for argument quality ranking, which outperforms several baselines on our own dataset, as well as previous methods published for another dataset.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes