LGDec 20, 2024

Measuring Cross-Modal Interactions in Multimodal Models

arXiv:2412.15828v215 citationsh-index: 26Has CodeAAAI
Originality Highly original
AI Analysis

This addresses the need for explainable AI in healthcare to improve clinical adoption by handling multiple data sources and offering personalized insights, representing a novel method for a known bottleneck.

The paper tackles the problem of quantifying cross-modal interactions in multimodal AI models, particularly for healthcare applications, by introducing InterSHAP, a method that accurately measures these interactions and provides individualized explanations without relying on approximations or labeled data.

Integrating AI in healthcare can greatly improve patient care and system efficiency. However, the lack of explainability in AI systems (XAI) hinders their clinical adoption, especially in multimodal settings that use increasingly complex model architectures. Most existing XAI methods focus on unimodal models, which fail to capture cross-modal interactions crucial for understanding the combined impact of multiple data sources. Existing methods for quantifying cross-modal interactions are limited to two modalities, rely on labelled data, and depend on model performance. This is problematic in healthcare, where XAI must handle multiple data sources and provide individualised explanations. This paper introduces InterSHAP, a cross-modal interaction score that addresses the limitations of existing approaches. InterSHAP uses the Shapley interaction index to precisely separate and quantify the contributions of the individual modalities and their interactions without approximations. By integrating an open-source implementation with the SHAP package, we enhance reproducibility and ease of use. We show that InterSHAP accurately measures the presence of cross-modal interactions, can handle multiple modalities, and provides detailed explanations at a local level for individual samples. Furthermore, we apply InterSHAP to multimodal medical datasets and demonstrate its applicability for individualised explanations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes