IVAICVOct 2, 2025

SpurBreast: A Curated Dataset for Investigating Spurious Correlations in Real-world Breast MRI Classification

arXiv:2510.02109v1h-index: 27Has CodeMICCAI
Originality Incremental advance
AI Analysis

This addresses the challenge of deploying DNNs in real-world medical imaging by providing a dataset to study spurious correlations, though it is incremental as it builds on existing work on dataset curation for robustness.

The authors tackled the problem of spurious correlations in breast MRI classification by introducing SpurBreast, a curated dataset that intentionally includes spurious signals like magnetic field strength and image orientation, and showed that DNNs exploit these to achieve high validation accuracy but fail to generalize to unbiased test data.

Deep neural networks (DNNs) have demonstrated remarkable success in medical imaging, yet their real-world deployment remains challenging due to spurious correlations, where models can learn non-clinical features instead of meaningful medical patterns. Existing medical imaging datasets are not designed to systematically study this issue, largely due to restrictive licensing and limited supplementary patient data. To address this gap, we introduce SpurBreast, a curated breast MRI dataset that intentionally incorporates spurious correlations to evaluate their impact on model performance. Analyzing over 100 features involving patient, device, and imaging protocol, we identify two dominant spurious signals: magnetic field strength (a global feature influencing the entire image) and image orientation (a local feature affecting spatial alignment). Through controlled dataset splits, we demonstrate that DNNs can exploit these non-clinical signals, achieving high validation accuracy while failing to generalize to unbiased test data. Alongside these two datasets containing spurious correlations, we also provide benchmark datasets without spurious correlations, allowing researchers to systematically investigate clinically relevant and irrelevant features, uncertainty estimation, adversarial robustness, and generalization strategies. Models and datasets are available at https://github.com/utkuozbulak/spurbreast.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes