LGCLDec 10, 2024

Bridging the Gap for Test-Time Multimodal Sentiment Analysis

arXiv:2412.07121v22 citationsh-index: 7Has Code
AI Analysis

This addresses a privacy and storage issue in adapting models for real-world dynamic scenarios, though it is incremental as it builds on existing test-time adaptation methods.

The paper tackles performance degradation in multimodal sentiment analysis due to distribution shifts between source and target data by proposing CASP, a test-time adaptation method that uses contrastive adaptation and stable pseudo-label generation, achieving significant improvements across various settings and backbones.

Multimodal sentiment analysis (MSA) is an emerging research topic that aims to understand and recognize human sentiment or emotions through multiple modalities. However, in real-world dynamic scenarios, the distribution of target data is always changing and different from the source data used to train the model, which leads to performance degradation. Common adaptation methods usually need source data, which could pose privacy issues or storage overheads. Therefore, test-time adaptation (TTA) methods are introduced to improve the performance of the model at inference time. Existing TTA methods are always based on probabilistic models and unimodal learning, and thus can not be applied to MSA which is often considered as a multimodal regression task. In this paper, we propose two strategies: Contrastive Adaptation and Stable Pseudo-label generation (CASP) for test-time adaptation for multimodal sentiment analysis. The two strategies deal with the distribution shifts for MSA by enforcing consistency and minimizing empirical risk, respectively. Extensive experiments show that CASP brings significant and consistent improvements to the performance of the model across various distribution shift settings and with different backbones, demonstrating its effectiveness and versatility. Our codes are available at https://github.com/zrguo/CASP.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes