CLMar 19, 2019

Bidirectional Recurrent Models for Offensive Tweet Classification

arXiv:1903.08808v12 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of detecting and classifying offensive content in tweets, which is an incremental improvement for social media moderation.

The paper tackled offensive tweet detection and classification by proposing four deep recurrent architectures, finding that model complexity did not necessarily improve results, with the best-performing model being a simple bidirectional LSTM achieving competitive F1-scores on validation and test sets.

In this paper we propose four deep recurrent architectures to tackle the task of offensive tweet detection as well as further classification into targeting and subject of said targeting. Our architectures are based on LSTMs and GRUs, we present a simple bidirectional LSTM as a baseline system and then further increase the complexity of the models by adding convolutional layers and implementing a split-process-merge architecture with LSTM and GRU as processors. Multiple pre-processing techniques were also investigated. The validation F1-score results from each model are presented for the three subtasks as well as the final F1-score performance on the private competition test set. It was found that model complexity did not necessarily yield better results. Our best-performing model was also the simplest, a bidirectional LSTM; closely followed by a two-branch bidirectional LSTM and GRU architecture.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes