CVAIMMJan 15

Handling Missing Modalities in Multimodal Survival Prediction for Non-Small Cell Lung Cancer

arXiv:2601.10386v11 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the clinical applicability of multimodal deep learning for NSCLC prognosis by handling missing data without dropping patients, though it is incremental in improving fusion strategies.

The paper tackled the problem of missing modalities in multimodal survival prediction for non-small cell lung cancer by developing a missing-aware framework that integrates CT, WSI, and clinical data, achieving a C-index of 73.30 with WSI and clinical fusion.

Accurate survival prediction in Non-Small Cell Lung Cancer (NSCLC) requires the integration of heterogeneous clinical, radiological, and histopathological information. While Multimodal Deep Learning (MDL) offers a promises for precision prognosis and survival prediction, its clinical applicability is severely limited by small cohort sizes and the presence of missing modalities, often forcing complete-case filtering or aggressive imputation. In this work, we present a missing-aware multimodal survival framework that integrates Computed Tomography (CT), Whole-Slide Histopathology (WSI) Images, and structured clinical variables for overall survival modeling in unresectable stage II-III NSCLC. By leveraging Foundation Models (FM) for modality-specific feature extraction and a missing-aware encoding strategy, the proposed approach enables intermediate multimodal fusion under naturally incomplete modality profiles. The proposed architecture is resilient to missing modalities by design, allowing the model to utilize all available data without being forced to drop patients during training or inference. Experimental results demonstrate that intermediate fusion consistently outperforms unimodal baselines as well as early and late fusion strategies, with the strongest performance achieved by the fusion of WSI and clinical modalities (73.30 C-index). Further analyses of modality importance reveal an adaptive behavior in which less informative modalities, i.e., CT modality, are automatically down-weighted and contribute less to the final survival prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes