CLLGMay 19, 2021

Methods for Detoxification of Texts for the Russian Language

arXiv:2105.09052v128 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of processing toxic content in social media for Russian language users, but it is incremental as it adapts existing methods to a new language.

The study tackled automatic detoxification of Russian texts to combat offensive language, testing unsupervised BERT-based and supervised GPT-2-based models, and found they can be successfully used for detoxification with room for improvement.

We introduce the first study of automatic detoxification of Russian texts to combat offensive language. Such a kind of textual style transfer can be used, for instance, for processing toxic content in social media. While much work has been done for the English language in this field, it has never been solved for the Russian language yet. We test two types of models - unsupervised approach based on BERT architecture that performs local corrections and supervised approach based on pretrained language GPT-2 model - and compare them with several baselines. In addition, we describe evaluation setup providing training datasets and metrics for automatic evaluation. The results show that the tested approaches can be successfully used for detoxification, although there is room for improvement.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes