CLAIOct 20, 2025

Efficient Toxicity Detection in Gaming Chats: A Comparative Study of Embeddings, Fine-Tuned Transformers and LLMs

arXiv:2510.17924v11 citationsh-index: 2Journal of Data Mining & Digital Humanities
Originality Synthesis-oriented
AI Analysis

It addresses the need for cost-effective content moderation in dynamic online gaming environments, though it is incremental as it compares existing methods.

This paper tackled the problem of automated toxicity detection in online gaming chats by comparing various NLP methods, finding that fine-tuned DistilBERT achieved optimal accuracy-cost trade-offs.

This paper presents a comprehensive comparative analysis of Natural Language Processing (NLP) methods for automated toxicity detection in online gaming chats. Traditional machine learning models with embeddings, large language models (LLMs) with zero-shot and few-shot prompting, fine-tuned transformer models, and retrieval-augmented generation (RAG) approaches are evaluated. The evaluation framework assesses three critical dimensions: classification accuracy, processing speed, and computational costs. A hybrid moderation system architecture is proposed that optimizes human moderator workload through automated detection and incorporates continuous learning mechanisms. The experimental results demonstrate significant performance variations across methods, with fine-tuned DistilBERT achieving optimal accuracy-cost trade-offs. The findings provide empirical evidence for deploying cost-effective, efficient content moderation systems in dynamic online gaming environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes