CLFeb 11, 2025

Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation

arXiv:2502.07391v112 citationsh-index: 14Has CodeNAACL
Originality Incremental advance
AI Analysis

This work addresses the challenge of interpreting sarcasm in social media content for applications like sentiment analysis, though it is incremental in improving existing multimodal methods.

The paper tackled the problem of generating natural language explanations for sarcasm in multimodal posts by proposing TURBO, a model that incorporates the target of sarcasm and uses a shared-fusion mechanism, achieving an average performance improvement of +3.3% over baselines on the MORE+ dataset.

Sarcasm is a linguistic phenomenon that intends to ridicule a target (e.g., entity, event, or person) in an inherent way. Multimodal Sarcasm Explanation (MuSE) aims at revealing the intended irony in a sarcastic post using a natural language explanation. Though important, existing systems overlooked the significance of the target of sarcasm in generating explanations. In this paper, we propose a Target-aUgmented shaRed fusion-Based sarcasm explanatiOn model, aka. TURBO. We design a novel shared-fusion mechanism to leverage the inter-modality relationships between an image and its caption. TURBO assumes the target of the sarcasm and guides the multimodal shared fusion mechanism in learning intricacies of the intended irony for explanations. We evaluate our proposed TURBO model on the MORE+ dataset. Comparison against multiple baselines and state-of-the-art models signifies the performance improvement of TURBO by an average margin of $+3.3\%$. Moreover, we explore LLMs in zero and one-shot settings for our task and observe that LLM-generated explanation, though remarkable, often fails to capture the critical nuances of the sarcasm. Furthermore, we supplement our study with extensive human evaluation on TURBO's generated explanations and find them out to be comparatively better than other systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes