CLAILGFeb 18, 2025

A Fuzzy Evaluation of Sentence Encoders on Grooming Risk Classification

arXiv:2502.12576v11 citationsh-index: 7
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of protecting children from online grooming by law enforcement and victims, but it is incremental as it evaluates existing models rather than proposing a new solution.

The paper tackled the problem of detecting grooming risk in online chat conversations, where predators use indirect and coded language to evade detection, and found that fine-tuned models fail to tag such instances, often misclassifying due to a higher presence of out-of-vocabulary words.

With the advent of social media, children are becoming increasingly vulnerable to the risk of grooming in online settings. Detecting grooming instances in an online conversation poses a significant challenge as the interactions are not necessarily sexually explicit, since the predators take time to build trust and a relationship with their victim. Moreover, predators evade detection using indirect and coded language. While previous studies have fine-tuned Transformers to automatically identify grooming in chat conversations, they overlook the impact of coded and indirect language on model predictions, and how these align with human perceptions of grooming. In this paper, we address this gap and evaluate bi-encoders on the task of classifying different degrees of grooming risk in chat contexts, for three different participant groups, i.e. law enforcement officers, real victims, and decoys. Using a fuzzy-theoretic framework, we map human assessments of grooming behaviors to estimate the actual degree of grooming risk. Our analysis reveals that fine-tuned models fail to tag instances where the predator uses indirect speech pathways and coded language to evade detection. Further, we find that such instances are characterized by a higher presence of out-of-vocabulary (OOV) words in samples, causing the model to misclassify. Our findings highlight the need for more robust models to identify coded language from noisy chat inputs in grooming contexts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes