CLSISep 11, 2021

Latent Hatred: A Benchmark for Understanding Implicit Hate Speech

arXiv:2109.05322v1704 citations
Originality Synthesis-oriented
AI Analysis

This addresses the pervasive issue of coded hate speech for social media platforms and researchers, providing a tool for better detection and understanding.

The paper tackles the problem of implicit hate speech on social media by introducing a new benchmark dataset with a taxonomy and fine-grained labels, and it shows that contemporary models struggle with this form of speech, highlighting key challenging features.

Hate speech has grown significantly on social media, causing serious consequences for victims of all demographics. Despite much attention being paid to characterize and detect discriminatory speech, most work has focused on explicit or overt hate speech, failing to address a more pervasive form based on coded or indirect language. To fill this gap, this work introduces a theoretically-justified taxonomy of implicit hate speech and a benchmark corpus with fine-grained labels for each message and its implication. We present systematic analyses of our dataset using contemporary baselines to detect and explain implicit hate speech, and we discuss key features that challenge existing models. This dataset will continue to serve as a useful benchmark for understanding this multifaceted issue.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes