CVAIOct 20, 2020

Cross-Modal Information Maximization for Medical Imaging: CMIM

arXiv:2010.10593v36 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of data silos in hospitals by enhancing AI robustness for medical imaging, though it is incremental as it builds on existing mutual information maximization techniques.

The paper tackles the problem of learning multi-modal representations for medical imaging that remain effective when some modalities are missing at test time, achieving state-of-the-art performance in classification and segmentation tasks with notable improvements for weaker modalities.

In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not always be available at test-time. In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. By maximizing cross-modal information at train time, we are able to outperform several state-of-the-art baselines in two different settings, medical image classification, and segmentation. In particular, our method is shown to have a strong impact on the inference-time performance of weaker modalities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes