DENS: A Dataset for Multi-class Emotion Analysis
This provides a dataset for researchers in natural language processing to tackle emotion analysis in long-form narratives, though it is incremental as it builds on existing emotion analysis tasks.
The authors introduced DENS, a new dataset for multi-class emotion analysis from long-form English narratives, and found that fine-tuning a pre-trained BERT model achieved the best results with an average micro-F1 score of 60.4%.
We introduce a new dataset for multi-class emotion analysis from long-form narratives in English. The Dataset for Emotions of Narrative Sequences (DENS) was collected from both classic literature available on Project Gutenberg and modern online narratives available on Wattpad, annotated using Amazon Mechanical Turk. A number of statistics and baseline benchmarks are provided for the dataset. Of the tested techniques, we find that the fine-tuning of a pre-trained BERT model achieves the best results, with an average micro-F1 score of 60.4%. Our results show that the dataset provides a novel opportunity in emotion analysis that requires moving beyond existing sentence-level techniques.