IVCVLGJul 14, 2022

MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

arXiv:2207.07027v287 citationsh-index: 30
Originality Incremental advance
AI Analysis

This addresses the challenge of integrating clinical time-series and chest X-ray data for healthcare tasks, offering a robust solution for partially paired datasets, though it is incremental as it builds on existing fusion methods.

The paper tackled the problem of multi-modal fusion in healthcare, where data is often asynchronously collected, by proposing MedFuse, an LSTM-based module that handles both uni-modal and multi-modal inputs, resulting in a large performance improvement on in-hospital mortality prediction and phenotype classification benchmarks compared to more complex strategies.

Multi-modal fusion approaches aim to integrate information from different data sources. Unlike natural datasets, such as in audio-visual applications, where samples consist of "paired" modalities, data in healthcare is often collected asynchronously. Hence, requiring the presence of all modalities for a given sample is not realistic for clinical tasks and significantly limits the size of the dataset during training. In this paper, we propose MedFuse, a conceptually simple yet promising LSTM-based fusion module that can accommodate uni-modal as well as multi-modal input. We evaluate the fusion method and introduce new benchmark results for in-hospital mortality prediction and phenotype classification, using clinical time-series data in the MIMIC-IV dataset and corresponding chest X-ray images in MIMIC-CXR. Compared to more complex multi-modal fusion strategies, MedFuse provides a performance improvement by a large margin on the fully paired test set. It also remains robust across the partially paired test set containing samples with missing chest X-ray images. We release our code for reproducibility and to enable the evaluation of competing models in the future.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes