CYCRSIMay 21, 2018

"You Know What to Do": Proactive Detection of YouTube Videos Targeted by Coordinated Hate Attacks

arXiv:1805.08168v376 citations
Originality Incremental advance
AI Analysis

This addresses the issue of online harassment on platforms like YouTube, where current reactive measures are insufficient, offering a proactive detection system.

The paper tackles the problem of detecting YouTube videos likely to be targeted by coordinated hate attacks, such as raids from communities like 4chan, by modeling video features and using an ensemble of classifiers, achieving an AUC of up to 94%.

Video sharing platforms like YouTube are increasingly targeted by aggression and hate attacks. Prior work has shown how these attacks often take place as a result of "raids," i.e., organized efforts by ad-hoc mobs coordinating from third-party communities. Despite the increasing relevance of this phenomenon, however, online services often lack effective countermeasures to mitigate it. Unlike well-studied problems like spam and phishing, coordinated aggressive behavior both targets and is perpetrated by humans, making defense mechanisms that look for automated activity unsuitable. Therefore, the de-facto solution is to reactively rely on user reports and human moderation. In this paper, we propose an automated solution to identify YouTube videos that are likely to be targeted by coordinated harassers from fringe communities like 4chan. First, we characterize and model YouTube videos along several axes (metadata, audio transcripts, thumbnails) based on a ground truth dataset of videos that were targeted by raids. Then, we use an ensemble of classifiers to determine the likelihood that a video will be raided with very good results (AUC up to 94%). Overall, our work provides an important first step towards deploying proactive systems to detect and mitigate coordinated hate attacks on platforms like YouTube.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes