CLDec 23, 2022

Content Rating Classification for Fan Fiction

arXiv:2212.12496v11 citationsh-index: 7

Originality Synthesis-oriented

AI Analysis

This addresses the need for automated content rating in fan fiction, though it is incremental as it adapts existing NLP methods to a new domain with limited success.

The paper tackled the problem of automatically classifying content ratings for fan fiction text, finding that multi-class classification methods produced poor accuracy, but binary classification improved results.

Content ratings can enable audiences to determine the suitability of various media products. With the recent advent of fan fiction, the critical issue of fan fiction content ratings has emerged. Whether fan fiction content ratings are done voluntarily or required by regulation, there is the need to automate the content rating classification. The problem is to take fan fiction text and determine the appropriate content rating. Methods for other domains, such as online books, have been attempted though none have been applied to fan fiction. We propose natural language processing techniques, including traditional and deep learning methods, to automatically determine the content rating. We show that these methods produce poor accuracy results for multi-classification. We then demonstrate that treating the problem as a binary classification problem produces better accuracy. Finally, we believe and provide some evidence that the current approach of self-annotating has led to incorrect labels limiting classification results.

View on arXiv PDF

Similar