LG CL CYMar 4, 2025

AI Enabled User-Specific Cyberbullying Severity Detection with Explainability

Tabia Tanzin Prama, Jannatul Ferdaws Amrin, Md. Mushfique Anwar, Iqbal H. Sarker

arXiv:2503.10650v24.13 citationsh-index: 10

Originality Incremental advance

AI Analysis

This work addresses cyberbullying mitigation for social media users by providing a more personalized detection system, though it is incremental as it builds on existing ML models with added user factors.

The study tackled cyberbullying detection by integrating user-specific attributes like psychological factors and demographics with social media comments, achieving 98% accuracy and an F1-score of 0.97 in severity classification. It used explainable AI to identify key risk factors, such as racial and gender targeting linked to depression and low self-esteem.

The rise of social media has significantly increased the prevalence of cyberbullying (CB), posing serious risks to both mental and physical well-being. Effective detection systems are essential for mitigating its impact. While several machine learning (ML) models have been developed, few incorporate victims' psychological, demographic, and behavioral factors alongside bullying comments to assess severity. In this study, we propose an AI model intregrating user-specific attributes, including psychological factors (self-esteem, anxiety, depression), online behavior (internet usage, disciplinary history), and demographic attributes (race, gender, ethnicity), along with social media comments. Additionally, we introduce a re-labeling technique that categorizes social media comments into three severity levels: Not Bullying, Mild Bullying, and Severe Bullying, considering user-specific factors.Our LSTM model is trained using 146 features, incorporating emotional, topical, and word2vec representations of social media comments as well as user-level attributes and it outperforms existing baseline models, achieving the highest accuracy of 98\% and an F1-score of 0.97. To identify key factors influencing the severity of cyberbullying, we employ explainable AI techniques (SHAP and LIME) to interpret the model's decision-making process. Our findings reveal that, beyond hate comments, victims belonging to specific racial and gender groups are more frequently targeted and exhibit higher incidences of depression, disciplinary issues, and low self-esteem. Additionally, individuals with a prior history of bullying are at a greater risk of becoming victims of cyberbullying.

View on arXiv PDF

Similar