CLJan 11, 2016

Argumentation Mining in User-Generated Web Discourse

arXiv:1601.02403v5301 citations

AI Analysis

This work addresses the problem of analyzing arguments in diverse web data for computational linguistics, but it is incremental as it builds on existing methods.

The paper tackled argumentation mining in noisy, user-generated web discourse by creating a new gold standard corpus and adapting an argumentation model, showing it is feasible but challenging.

The goal of argumentation mining, an evolving research field in computational linguistics, is to design methods capable of analyzing people's argumentation. In this article, we go beyond the state of the art in several ways. (i) We deal with actual Web data and take up the challenges given by the variety of registers, multiple domains, and unrestricted noisy user-generated Web discourse. (ii) We bridge the gap between normative argumentation theories and argumentation phenomena encountered in actual data by adapting an argumentation model tested in an extensive annotation study. (iii) We create a new gold standard corpus (90k tokens in 340 documents) and experiment with several machine learning methods to identify argument components. We offer the data, source codes, and annotation guidelines to the community under free licenses. Our findings show that argumentation mining in user-generated Web discourse is a feasible but challenging task.

View on arXiv PDF

Similar