CLMay 20, 2021

Multi-modal Sarcasm Detection and Humor Classification in Code-mixed Conversations

arXiv:2105.09984v298 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in multi-modal analysis for non-English languages, specifically Hindi-English code-mixed conversations, by providing a new dataset and model, though it is incremental in applying attention mechanisms to this domain.

The authors tackled sarcasm detection and humor classification in Hindi-English code-mixed conversations by creating the first multi-modal dataset (MaSaC) and proposing a novel neural architecture (MSH-COMICS), which achieved improvements of >1 F1-score point for sarcasm detection and 10 F1-score points for humor classification over existing models.

Sarcasm detection and humor classification are inherently subtle problems, primarily due to their dependence on the contextual and non-verbal information. Furthermore, existing studies in these two topics are usually constrained in non-English languages such as Hindi, due to the unavailability of qualitative annotated datasets. In this work, we make two major contributions considering the above limitations: (1) we develop a Hindi-English code-mixed dataset, MaSaC, for the multi-modal sarcasm detection and humor classification in conversational dialog, which to our knowledge is the first dataset of its kind; (2) we propose MSH-COMICS, a novel attention-rich neural architecture for the utterance classification. We learn efficient utterance representation utilizing a hierarchical attention mechanism that attends to a small portion of the input sentence at a time. Further, we incorporate dialog-level contextual attention mechanism to leverage the dialog history for the multi-modal classification. We perform extensive experiments for both the tasks by varying multi-modal inputs and various submodules of MSH-COMICS. We also conduct comparative analysis against existing approaches. We observe that MSH-COMICS attains superior performance over the existing models by > 1 F1-score point for the sarcasm detection and 10 F1-score points in humor classification. We diagnose our model and perform thorough analysis of the results to understand the superiority and pitfalls.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes