SPLGApr 28, 2025

Towards Robust Multimodal Physiological Foundation Models: Handling Arbitrary Missing Modalities

arXiv:2504.19596v28 citationsh-index: 7Has Code
Originality Highly original
AI Analysis

This addresses the challenge of robust multimodal analysis in healthcare and brain-computer interfaces, offering a novel foundation model approach.

The paper tackles the problem of learning universal representations from multimodal physiological signals that generalize across datasets and handle missing modalities at inference, proposing PhysioOmni, which achieves state-of-the-art performance on four downstream tasks while maintaining strong robustness to missing modalities.

Multimodal physiological signals, such as EEG, ECG, EOG, and EMG, are crucial for healthcare and brain-computer interfaces. While existing methods rely on specialized architectures and dataset-specific fusion strategies, they struggle to learn universal representations that generalize across datasets and handle missing modalities at inference time. To address these issues, we propose PhysioOmni, a foundation model for multimodal physiological signal analysis that models both homogeneous and heterogeneous features to decouple multimodal signals and extract generic representations while maintaining compatibility with arbitrary missing modalities. PhysioOmni trains a decoupled multimodal tokenizer, enabling masked signal pre-training via modality-invariant and modality-specific objectives. To ensure adaptability to diverse and incomplete modality combinations, the pre-trained encoders undergo resilient fine-tuning with prototype alignment on downstream datasets. Extensive experiments on four downstream tasks, emotion recognition, sleep stage classification, motor prediction, and mental workload detection, demonstrate that PhysioOmni achieves state-of-the-art performance while maintaining strong robustness to missing modalities. Our code and model weights will be released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes