SIAICLCRCYFeb 2, 2022

Automated Detection of Doxing on Twitter

arXiv:2202.00879v118 citations
Originality Incremental advance
AI Analysis

This addresses the need for automated detection of doxing, a specific form of cyberbullying on Twitter, but is incremental as it builds on existing cyberbullying detection methods.

The paper tackled the problem of automatically detecting doxing on Twitter, which involves disclosing sensitive personal information without consent, and achieved 96.86% accuracy and 97.37% recall using contextualized string embeddings.

Doxing refers to the practice of disclosing sensitive personal information about a person without their consent. This form of cyberbullying is an unpleasant and sometimes dangerous phenomenon for online social networks. Although prior work exists on automated identification of other types of cyberbullying, a need exists for methods capable of detecting doxing on Twitter specifically. We propose and evaluate a set of approaches for automatically detecting second- and third-party disclosures on Twitter of sensitive private information, a subset of which constitutes doxing. We summarize our findings of common intentions behind doxing episodes and compare nine different approaches for automated detection based on string-matching and one-hot encoded heuristics, as well as word and contextualized string embedding representations of tweets. We identify an approach providing 96.86% accuracy and 97.37% recall using contextualized string embeddings and conclude by discussing the practicality of our proposed methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes