CLJan 26, 2025

STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection

arXiv:2501.15451v39 citationsh-index: 15ACL
Originality Synthesis-oriented
AI Analysis

This addresses the lack of fine-grained resources for Chinese hate speech detection, which is important for researchers and practitioners in NLP and content moderation, though it is incremental in providing new data and evaluations.

The authors tackled the problem of fine-grained hate speech detection in Chinese by creating STATE ToxiCN, the first span-level Chinese hate speech dataset with Target-Argument-Hateful-Group quadruples, and used it to evaluate existing models and study Chinese hateful slang, including LLM detection capabilities.

The proliferation of hate speech has caused significant harm to society. The intensity and directionality of hate are closely tied to the target and argument it is associated with. However, research on hate speech detection in Chinese has lagged behind, and existing datasets lack span-level fine-grained annotations. Furthermore, the lack of research on Chinese hateful slang poses a significant challenge. In this paper, we provide a solution for fine-grained detection of Chinese hate speech. First, we construct a dataset containing Target-Argument-Hateful-Group quadruples (STATE ToxiCN), which is the first span-level Chinese hate speech dataset. Secondly, we evaluate the span-level hate speech detection performance of existing models using STATE ToxiCN. Finally, we conduct the first study on Chinese hateful slang and evaluate the ability of LLMs to detect such expressions. Our work contributes valuable resources and insights to advance span-level hate speech detection in Chinese.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes