CL CV MMMar 31, 2024

MIPS at SemEval-2024 Task 3: Multimodal Emotion-Cause Pair Extraction in Conversations with Multimodal Language Models

Zebang Cheng, Fuqiang Niu, Yuxiang Lin, Zhi-Qi Cheng, Bowen Zhang, Xiaojiang Peng

CMUUW

arXiv:2404.00511v316.229 citationsh-index: 26Has CodeSemEval

Originality Incremental advance

AI Analysis

This addresses emotion analysis in conversations for NLP and multimodal AI, but it is incremental as it builds on existing methods with specific improvements.

The paper tackled multimodal emotion-cause pair extraction in conversations by proposing a MER-MCE framework that integrates text, audio, and visual modalities, achieving a weighted F1 score of 0.3435 and ranking third in the competition.

This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations. We propose a novel Multimodal Emotion Recognition and Multimodal Emotion Cause Extraction (MER-MCE) framework that integrates text, audio, and visual modalities using specialized emotion encoders. Our approach sets itself apart from top-performing teams by leveraging modality-specific features for enhanced emotion understanding and causality inference. Experimental evaluation demonstrates the advantages of our multimodal approach, with our submission achieving a competitive weighted F1 score of 0.3435, ranking third with a margin of only 0.0339 behind the 1st team and 0.0025 behind the 2nd team. Project: https://github.com/MIPS-COLT/MER-MCE.git

View on arXiv PDF Code

Similar