CLOct 5, 2018

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations

arXiv:1810.02508v61578 citations
Originality Synthesis-oriented
AI Analysis

This provides a dataset for researchers working on emotion recognition in conversations, but it is incremental as it extends an existing dataset.

The authors tackled the lack of a large-scale multimodal multi-party emotional conversational dataset by creating MELD, which includes about 13,000 utterances from 1,433 dialogues with emotion and sentiment labels across audio, visual, and textual modalities, and demonstrated the importance of contextual and multimodal information for emotion recognition.

Emotion recognition in conversations is a challenging task that has recently gained popularity due to its potential applications. Until now, however, a large-scale multimodal multi-party emotional conversational database containing more than two speakers per dialogue was missing. Thus, we propose the Multimodal EmotionLines Dataset (MELD), an extension and enhancement of EmotionLines. MELD contains about 13,000 utterances from 1,433 dialogues from the TV-series Friends. Each utterance is annotated with emotion and sentiment labels, and encompasses audio, visual and textual modalities. We propose several strong multimodal baselines and show the importance of contextual and multimodal information for emotion recognition in conversations. The full dataset is available for use at http:// affective-meld.github.io.

Code Implementations8 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes