CLJul 17, 2024

MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models

arXiv:2407.12309v110 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses the challenge of effectively combining multimodal EHR data for clinical predictions, which is incremental as it builds on existing methods like transformers and LLMs.

The paper tackled the problem of multimodal EHR data fusion by proposing MEDFuse, a framework that integrates structured lab tests and unstructured clinical notes using masked lab-test modeling and LLMs, achieving over 90% F1 score in a 10-disease multi-label classification task.

Electronic health records (EHRs) are multimodal by nature, consisting of structured tabular features like lab tests and unstructured clinical notes. In real-life clinical practice, doctors use complementary multimodal EHR data sources to get a clearer picture of patients' health and support clinical decision-making. However, most EHR predictive models do not reflect these procedures, as they either focus on a single modality or overlook the inter-modality interactions/redundancy. In this work, we propose MEDFuse, a Multimodal EHR Data Fusion framework that incorporates masked lab-test modeling and large language models (LLMs) to effectively integrate structured and unstructured medical data. MEDFuse leverages multimodal embeddings extracted from two sources: LLMs fine-tuned on free clinical text and masked tabular transformers trained on structured lab test results. We design a disentangled transformer module, optimized by a mutual information loss to 1) decouple modality-specific and modality-shared information and 2) extract useful joint representation from the noise and redundancy present in clinical notes. Through comprehensive validation on the public MIMIC-III dataset and the in-house FEMH dataset, MEDFuse demonstrates great potential in advancing clinical predictions, achieving over 90% F1 score in the 10-disease multi-label classification task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes