CLJun 12, 2024

Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection

arXiv:2406.07886v127 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the problem of implicit hate speech detection for social media and content moderation, representing an incremental improvement over prior contrastive learning methods.

The paper tackles the challenge of detecting implicit hate speech by proposing Label-aware Hard Negative sampling strategies (LAHN) with momentum contrastive learning, which outperforms existing models in both in-dataset and cross-dataset evaluations.

Detecting implicit hate speech that is not directly hateful remains a challenge. Recent research has attempted to detect implicit hate speech by applying contrastive learning to pre-trained language models such as BERT and RoBERTa, but the proposed models still do not have a significant advantage over cross-entropy loss-based learning. We found that contrastive learning based on randomly sampled batch data does not encourage the model to learn hard negative samples. In this work, we propose Label-aware Hard Negative sampling strategies (LAHN) that encourage the model to learn detailed features from hard negative samples, instead of naive negative samples in random batch, using momentum-integrated contrastive learning. LAHN outperforms the existing models for implicit hate speech detection both in- and cross-datasets. The code is available at https://github.com/Hanyang-HCC-Lab/LAHN

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes