LGMar 26, 2025

Reinforcement Learning for Efficient Toxicity Detection in Competitive Online Video Games

arXiv:2503.20968v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the challenge of identifying and mitigating toxic behavior for video game service operators, though it is incremental as it builds on existing bandit methods with domain-specific adaptations.

The paper tackled the problem of efficiently sampling for toxicity detection in competitive online video games by proposing a contextual bandit algorithm that uses domain expertise to make monitoring decisions, and it showed consistent outperformance over baseline algorithms using data from Call of Duty: Modern Warfare III.

Online platforms take proactive measures to detect and address undesirable behavior, aiming to focus these resource-intensive efforts where such behavior is most prevalent. This article considers the problem of efficient sampling for toxicity detection in competitive online video games. To make optimal monitoring decisions, video game service operators need estimates of the likelihood of toxic behavior. If no model is available for these predictions, one must be estimated in real time. To close this gap, we propose a contextual bandit algorithm that makes monitoring decisions based on a small set of variables that, according to domain expertise, are associated with toxic behavior. This algorithm balances exploration and exploitation to optimize long-term outcomes and is deliberately designed for easy deployment in production. Using data from the popular first-person action game Call of Duty: Modern Warfare III, we show that our algorithm consistently outperforms baseline algorithms that rely solely on players' past behavior. This finding has substantive implications for the nature of toxicity. It also illustrates how domain expertise can be harnessed to help video game service operators identify and mitigate toxicity, ultimately fostering a safer and more enjoyable gaming experience.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes