CLNov 26, 2019

A Large-scale Dataset for Argument Quality Ranking: Construction and Analysis

Shai Gretz, Roni Friedman, Edo Cohen-Karlik, Assaf Toledo, Dan Lahav, Ranit Aharonov, Noam Slonim

arXiv:1911.11408v17.6157 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the need for better argument quality assessment in computational argumentation, though it is incremental as it builds on existing datasets and methods.

The authors tackled the problem of ranking argument quality by constructing a large-scale dataset of 30,497 annotated arguments, which is five times larger than previous datasets, and developed a neural method that outperforms baselines on this and another dataset.

Identifying the quality of free-text arguments has become an important task in the rapidly expanding field of computational argumentation. In this work, we explore the challenging task of argument quality ranking. To this end, we created a corpus of 30,497 arguments carefully annotated for point-wise quality, released as part of this work. To the best of our knowledge, this is the largest dataset annotated for point-wise argument quality, larger by a factor of five than previously released datasets. Moreover, we address the core issue of inducing a labeled score from crowd annotations by performing a comprehensive evaluation of different approaches to this problem. In addition, we analyze the quality dimensions that characterize this dataset. Finally, we present a neural method for argument quality ranking, which outperforms several baselines on our own dataset, as well as previous methods published for another dataset.

View on arXiv PDF

Similar