CLAILGMay 5, 2022

COGMEN: COntextualized GNN based Multimodal Emotion recognitioN

arXiv:2205.02455v1635 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses emotion recognition for AI systems in conversational settings, representing an incremental improvement with specific gains.

The paper tackles multimodal emotion recognition in conversations by proposing COGMEN, a system that models local and global dependencies using Graph Neural Networks, achieving state-of-the-art results on IEMOCAP and MOSEI datasets.

Emotions are an inherent part of human interactions, and consequently, it is imperative to develop AI systems that understand and recognize human emotions. During a conversation involving various people, a person's emotions are influenced by the other speaker's utterances and their own emotional state over the utterances. In this paper, we propose COntextualized Graph Neural Network based Multimodal Emotion recognitioN (COGMEN) system that leverages local information (i.e., inter/intra dependency between speakers) and global information (context). The proposed model uses Graph Neural Network (GNN) based architecture to model the complex dependencies (local and global information) in a conversation. Our model gives state-of-the-art (SOTA) results on IEMOCAP and MOSEI datasets, and detailed ablation experiments show the importance of modeling information at both levels.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes