CL LGFeb 27, 2019

A Machine Learning Approach to Comment Toxicity Classification

arXiv:1903.06765v10.745 citations

Originality Synthesis-oriented

AI Analysis

This addresses the need for automated systems to detect and prevent toxic behavior in online environments like social media, though it appears incremental as it applies an existing tf-idf method to a known dataset.

The study tackled the problem of identifying toxic comments online, such as obscenity and threats, using a machine learning model on a Wikipedia dataset, achieving a mean validation accuracy of 98.08% and absolute validation accuracy of 91.61%.

Now-a-days, derogatory comments are often made by one another, not only in offline environment but also immensely in online environments like social networking websites and online communities. So, an Identification combined with Prevention System in all social networking websites and applications, including all the communities, existing in the digital world is a necessity. In such a system, the Identification Block should identify any negative online behaviour and should signal the Prevention Block to take action accordingly. This study aims to analyse any piece of text and detecting different types of toxicity like obscenity, threats, insults and identity-based hatred. The labelled Wikipedia Comment Dataset prepared by Jigsaw is used for the purpose. A 6-headed Machine Learning tf-idf Model has been made and trained separately, yielding a Mean Validation Accuracy of 98.08% and Absolute Validation Accuracy of 91.61%. Such an Automated System should be deployed for enhancing healthy online conversation

View on arXiv PDF

Similar