CLAug 30, 2018

Comparative Studies of Detecting Abusive Language on Twitter

arXiv:1808.10245v11105 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of abusive language detection for social media platforms, but it is incremental as it applies existing methods to a new dataset.

The paper tackles the problem of detecting abusive language on Twitter by conducting the first comparative study of various learning models on the Hate and Abusive Speech on Twitter dataset, finding that a bidirectional GRU network with Latent Topic Clustering achieves the highest accuracy with an F1 score of 0.805.

The context-dependent nature of online aggression makes annotating large collections of data extremely difficult. Previously studied datasets in abusive language detection have been insufficient in size to efficiently train deep learning models. Recently, Hate and Abusive Speech on Twitter, a dataset much greater in size and reliability, has been released. However, this dataset has not been comprehensively studied to its potential. In this paper, we conduct the first comparative study of various learning models on Hate and Abusive Speech on Twitter, and discuss the possibility of using additional features and context data for improvements. Experimental results show that bidirectional GRU networks trained on word-level features, with Latent Topic Clustering modules, is the most accurate model scoring 0.805 F1.

Code Implementations4 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes