CLSep 11, 2018

Multilingual Cross-domain Perspectives on Online Hate Speech

arXiv:1809.03944v117 citations
Originality Synthesis-oriented
AI Analysis

This work provides insights into hate speech patterns for researchers and policymakers, but it is incremental as it applies existing methods to new data.

The study analyzed eight multilingual online hate speech corpora to identify shared characteristics across jihadist, extremist, racist, and sexist content, using NLP techniques such as text classification and keyword extraction.

In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content. Analysis of the multilingual corpora shows that the different contexts share certain characteristics in their hateful rhetoric. To expose the main features, we have focused on text classification, text profiling, keyword and collocation extraction, along with manual annotation and qualitative study.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes