CL LG SIJul 31, 2023

Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks

João A. Leite, Carolina Scarton, Diego F. Silva

arXiv:2307.16609v121.2134 citationsh-index: 12Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of costly labeled data for content moderation, but it is incremental as it builds on existing self-training and augmentation techniques.

The paper tackled the problem of detecting offensive and hate speech in social media by using self-training methods with data augmentations to leverage unlabeled data, finding that default self-training improved performance by up to +1.5% F1-macro, but noisy self-training with augmentations decreased performance.

Online social media is rife with offensive and hateful comments, prompting the need for their automatic detection given the sheer amount of posts created every second. Creating high-quality human-labelled datasets for this task is difficult and costly, especially because non-offensive posts are significantly more frequent than offensive ones. However, unlabelled data is abundant, easier, and cheaper to obtain. In this scenario, self-training methods, using weakly-labelled examples to increase the amount of training data, can be employed. Recent "noisy" self-training approaches incorporate data augmentation techniques to ensure prediction consistency and increase robustness against noisy data and adversarial attacks. In this paper, we experiment with default and noisy self-training using three different textual data augmentation techniques across five different pre-trained BERT architectures varying in size. We evaluate our experiments on two offensive/hate-speech datasets and demonstrate that (i) self-training consistently improves performance regardless of model size, resulting in up to +1.5% F1-macro on both datasets, and (ii) noisy self-training with textual data augmentations, despite being successfully applied in similar settings, decreases performance on offensive and hate-speech domains when compared to the default method, even with state-of-the-art augmentations such as backtranslation.

View on arXiv PDF Code

Similar