LGAICLCVFeb 7, 2024

Examining Modality Incongruity in Multimodal Federated Learning for Medical Vision and Language-based Disease Detection

arXiv:2402.05294v111 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses modality incongruity in federated learning for medical applications, which is an incremental improvement over existing multimodal federated learning methods.

The paper investigates the impact of missing modalities, termed modality incongruity, in multimodal federated learning for medical disease detection, finding that incongruent setups can outperform unimodal federated learning and evaluating methods like self-attention and modality imputation to address this issue.

Multimodal Federated Learning (MMFL) utilizes multiple modalities in each client to build a more powerful Federated Learning (FL) model than its unimodal counterpart. However, the impact of missing modality in different clients, also called modality incongruity, has been greatly overlooked. This paper, for the first time, analyses the impact of modality incongruity and reveals its connection with data heterogeneity across participating clients. We particularly inspect whether incongruent MMFL with unimodal and multimodal clients is more beneficial than unimodal FL. Furthermore, we examine three potential routes of addressing this issue. Firstly, we study the effectiveness of various self-attention mechanisms towards incongruity-agnostic information fusion in MMFL. Secondly, we introduce a modality imputation network (MIN) pre-trained in a multimodal client for modality translation in unimodal clients and investigate its potential towards mitigating the missing modality problem. Thirdly, we assess the capability of client-level and server-level regularization techniques towards mitigating modality incongruity effects. Experiments are conducted under several MMFL settings on two publicly available real-world datasets, MIMIC-CXR and Open-I, with Chest X-Ray and radiology reports.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes