CVAIJun 22, 2025

Cause-Effect Driven Optimization for Robust Medical Visual Question Answering with Language Biases

arXiv:2506.17903v13 citationsh-index: 18IJCAI
Originality Incremental advance
AI Analysis

This work addresses bias issues in medical AI systems, which is critical for reliable healthcare applications, though it appears incremental as it builds on existing bias mitigation techniques.

The paper tackles language biases in Medical Visual Question Answering by proposing the CEDO framework, which integrates three mechanisms to mitigate biases from causal and effectual perspectives, achieving robust performance over state-of-the-art methods on multiple benchmarks.

Existing Medical Visual Question Answering (Med-VQA) models often suffer from language biases, where spurious correlations between question types and answer categories are inadvertently established. To address these issues, we propose a novel Cause-Effect Driven Optimization framework called CEDO, that incorporates three well-established mechanisms, i.e., Modality-driven Heterogeneous Optimization (MHO), Gradient-guided Modality Synergy (GMS), and Distribution-adapted Loss Rescaling (DLR), for comprehensively mitigating language biases from both causal and effectual perspectives. Specifically, MHO employs adaptive learning rates for specific modalities to achieve heterogeneous optimization, thus enhancing robust reasoning capabilities. Additionally, GMS leverages the Pareto optimization method to foster synergistic interactions between modalities and enforce gradient orthogonality to eliminate bias updates, thereby mitigating language biases from the effect side, i.e., shortcut bias. Furthermore, DLR is designed to assign adaptive weights to individual losses to ensure balanced learning across all answer categories, effectively alleviating language biases from the cause side, i.e., imbalance biases within datasets. Extensive experiments on multiple traditional and bias-sensitive benchmarks consistently demonstrate the robustness of CEDO over state-of-the-art competitors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes