CL SISep 11, 2021

Latent Hatred: A Benchmark for Understanding Implicit Hate Speech

Mai ElSherief, Caleb Ziems, David Muchlinski, Vaishnavi Anupindi, Jordyn Seybolt, Munmun De Choudhury, Diyi Yang

arXiv:2109.05322v132.8704 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the pervasive issue of coded hate speech for social media platforms and researchers, providing a tool for better detection and understanding.

The paper tackles the problem of implicit hate speech on social media by introducing a new benchmark dataset with a taxonomy and fine-grained labels, and it shows that contemporary models struggle with this form of speech, highlighting key challenging features.

Hate speech has grown significantly on social media, causing serious consequences for victims of all demographics. Despite much attention being paid to characterize and detect discriminatory speech, most work has focused on explicit or overt hate speech, failing to address a more pervasive form based on coded or indirect language. To fill this gap, this work introduces a theoretically-justified taxonomy of implicit hate speech and a benchmark corpus with fine-grained labels for each message and its implication. We present systematic analyses of our dataset using contemporary baselines to detect and explain implicit hate speech, and we discuss key features that challenge existing models. This dataset will continue to serve as a useful benchmark for understanding this multifaceted issue.

View on arXiv PDF Code

Similar