CLSep 30, 2021

Multi-Modal Sarcasm Detection Based on Contrastive Attention Mechanism

Xiaoqiang Zhang, Ying Chen, Guangyuan Li

arXiv:2109.15153v11.427 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses sarcasm detection for video communication applications, representing an incremental improvement in a growing research area.

The paper tackles multi-modal sarcasm detection in video conversations by proposing a ConAttSD model that uses an inter-modality contrastive attention mechanism to extract incongruity features between modalities, and it demonstrates effectiveness on the MUStARD benchmark dataset.

In the past decade, sarcasm detection has been intensively conducted in a textual scenario. With the popularization of video communication, the analysis in multi-modal scenarios has received much attention in recent years. Therefore, multi-modal sarcasm detection, which aims at detecting sarcasm in video conversations, becomes increasingly hot in both the natural language processing community and the multi-modal analysis community. In this paper, considering that sarcasm is often conveyed through incongruity between modalities (e.g., text expressing a compliment while acoustic tone indicating a grumble), we construct a Contras-tive-Attention-based Sarcasm Detection (ConAttSD) model, which uses an inter-modality contrastive attention mechanism to extract several contrastive features for an utterance. A contrastive feature represents the incongruity of information between two modalities. Our experiments on MUStARD, a benchmark multi-modal sarcasm dataset, demonstrate the effectiveness of the proposed ConAttSD model.

View on arXiv PDF Code

Similar