CLAug 30, 2018

Comparative Studies of Detecting Abusive Language on Twitter

Younghun Lee, Seunghyun Yoon, Kyomin Jung

arXiv:1808.10245v132.11105 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of abusive language detection for social media platforms, but it is incremental as it applies existing methods to a new dataset.

The paper tackles the problem of detecting abusive language on Twitter by conducting the first comparative study of various learning models on the Hate and Abusive Speech on Twitter dataset, finding that a bidirectional GRU network with Latent Topic Clustering achieves the highest accuracy with an F1 score of 0.805.

The context-dependent nature of online aggression makes annotating large collections of data extremely difficult. Previously studied datasets in abusive language detection have been insufficient in size to efficiently train deep learning models. Recently, Hate and Abusive Speech on Twitter, a dataset much greater in size and reliability, has been released. However, this dataset has not been comprehensively studied to its potential. In this paper, we conduct the first comparative study of various learning models on Hate and Abusive Speech on Twitter, and discuss the possibility of using additional features and context data for improvements. Experimental results show that bidirectional GRU networks trained on word-level features, with Latent Topic Clustering modules, is the most accurate model scoring 0.805 F1.

View on arXiv PDF Code

Similar