BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learning

Yizhou Wu, Shansong Wang, Yuheng Li, Mojtaba Safari, Mingzhe Hu, Chih-Wei Chang, Harini Veeraraghavan, Xiaofeng Yang

arXiv:2604.2727764.6

Predicted impact top 32% in LG · last 90 daysOriginality Incremental advance

AI Analysis

Provides a single, generalizable brain MRI representation that reduces the need for task-specific models and labeled data, benefiting the neuroimaging community.

BrainDINO, a self-supervised foundation model trained on 6.6 million unlabeled brain MRI slices, achieves equal or superior performance across diverse tasks (tumor segmentation, disease classification, age estimation, etc.) compared to baselines, with strong advantages under label scarcity.

Brain MRI underpins a wide range of neuroscientific and clinical applications, yet most learning-based methods remain task-specific and require substantial labeled data. Here we show that a single self-supervised representation can generalize across heterogeneous brain MRI endpoints. We trained BrainDINO, a self-distilled foundation model, on approximately 6.6 million unlabeled axial slices from 20 datasets encompassing broad variation in population, disease, and acquisition setting. Using a frozen encoder with lightweight task heads, BrainDINO supported transfer across tumor segmentation, neurodegenerative and neurodevelopmental conditions classification, brain age estimation, post-stroke temporal prediction, molecular status prediction, MRI sequence classification, and survival modeling. Across tasks and supervision regimes, BrainDINO consistently equaled or exceeded natural-image and MRI-specific self-supervised baselines, with particularly strong advantages under label scarcity. Representation analyses further showed anatomically organized and pathology-sensitive feature structure in the absence of task-specific supervision. Our findings indicate that large-scale slice-wise self-supervised learning can yield a unified brain MRI representation that supports diverse neuroimaging tasks without volumetric pretraining or full-network fine-tuning, establishing a scalable foundation for robust and data-efficient brain imaging analysis.

View on arXiv PDF

Similar