AILGSep 25, 2024

DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data

arXiv:2409.17055v28 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of integrating incomplete medical data like histopathology and MRI for improved prognosis prediction, though it is incremental as it builds on existing contrastive learning approaches.

The paper tackles the problem of learning from incomplete multimodal healthcare data by introducing DRIM, a method that disentangles shared and unique representations across modalities, achieving state-of-the-art performance on glioma survival prediction tasks with robustness to missing data.

Real-life medical data is often multimodal and incomplete, fueling the growing need for advanced deep learning models capable of integrating them efficiently. The use of diverse modalities, including histopathology slides, MRI, and genetic data, offers unprecedented opportunities to improve prognosis prediction and to unveil new treatment pathways. Contrastive learning, widely used for deriving representations from paired data in multimodal tasks, assumes that different views contain the same task-relevant information and leverages only shared information. This assumption becomes restrictive when handling medical data since each modality also harbors specific knowledge relevant to downstream tasks. We introduce DRIM, a new multimodal method for capturing these shared and unique representations, despite data sparsity. More specifically, given a set of modalities, we aim to encode a representation for each one that can be divided into two components: one encapsulating patient-related information common across modalities and the other, encapsulating modality-specific details. This is achieved by increasing the shared information among different patient modalities while minimizing the overlap between shared and unique components within each modality. Our method outperforms state-of-the-art algorithms on glioma patients survival prediction tasks, while being robust to missing modalities. To promote reproducibility, the code is made publicly available at https://github.com/Lucas-rbnt/DRIM

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes