CLOct 25, 2019

DENS: A Dataset for Multi-class Emotion Analysis

arXiv:1910.11769v11006 citations
Originality Synthesis-oriented
AI Analysis

This provides a dataset for researchers in natural language processing to tackle emotion analysis in long-form narratives, though it is incremental as it builds on existing emotion analysis tasks.

The authors introduced DENS, a new dataset for multi-class emotion analysis from long-form English narratives, and found that fine-tuning a pre-trained BERT model achieved the best results with an average micro-F1 score of 60.4%.

We introduce a new dataset for multi-class emotion analysis from long-form narratives in English. The Dataset for Emotions of Narrative Sequences (DENS) was collected from both classic literature available on Project Gutenberg and modern online narratives available on Wattpad, annotated using Amazon Mechanical Turk. A number of statistics and baseline benchmarks are provided for the dataset. Of the tested techniques, we find that the fine-tuning of a pre-trained BERT model achieves the best results, with an average micro-F1 score of 60.4%. Our results show that the dataset provides a novel opportunity in emotion analysis that requires moving beyond existing sentence-level techniques.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes