CLJul 20, 2023

General Debiasing for Multimodal Sentiment Analysis

Teng Sun, Juntong Ni, Wenjie Wang, Liqiang Jing, Yinwei Wei, Liqiang Nie

arXiv:2307.10511v24.328 citationsh-index: 76Has Code

Originality Incremental advance

AI Analysis

This addresses a critical issue for researchers and practitioners in multimodal AI by improving model robustness, though it is incremental as it builds on existing debiasing techniques.

The paper tackles the problem of spurious correlations in multimodal sentiment analysis, which can lead to poor out-of-distribution generalization, by proposing a debiasing framework based on inverse probability weighting that reduces reliance on biased features, resulting in superior generalization ability on multiple OOD testing sets.

Existing work on Multimodal Sentiment Analysis (MSA) utilizes multimodal information for prediction yet unavoidably suffers from fitting the spurious correlations between multimodal features and sentiment labels. For example, if most videos with a blue background have positive labels in a dataset, the model will rely on such correlations for prediction, while "blue background" is not a sentiment-related feature. To address this problem, we define a general debiasing MSA task, which aims to enhance the Out-Of-Distribution (OOD) generalization ability of MSA models by reducing their reliance on spurious correlations. To this end, we propose a general debiasing framework based on Inverse Probability Weighting (IPW), which adaptively assigns small weights to the samples with larger bias (i.e., the severer spurious correlations). The key to this debiasing framework is to estimate the bias of each sample, which is achieved by two steps: 1) disentangling the robust features and biased features in each modality, and 2) utilizing the biased features to estimate the bias. Finally, we employ IPW to reduce the effects of large-biased samples, facilitating robust feature learning for sentiment prediction. To examine the model's generalization ability, we keep the original testing sets on two benchmarks and additionally construct multiple unimodal and multimodal OOD testing sets. The empirical results demonstrate the superior generalization ability of our proposed framework. We have released the code and data to facilitate the reproduction https://github.com/Teng-Sun/GEAR.

View on arXiv PDF Code

Similar