IVAICVJul 11, 2024

FedMedICL: Towards Holistic Evaluation of Distribution Shifts in Federated Medical Imaging

arXiv:2407.08822v16 citationsh-index: 21
Originality Incremental advance
AI Analysis

This addresses the need for holistic evaluation in federated learning for medical imaging, which is crucial for clinical impact but often incremental in methodology.

The paper tackled the problem of evaluating distribution shifts in federated medical imaging by introducing FedMedICL, a unified benchmark that simultaneously captures label, demographic, and temporal shifts across six datasets, and found that a simple batch balancing technique outperformed advanced methods in average performance.

For medical imaging AI models to be clinically impactful, they must generalize. However, this goal is hindered by (i) diverse types of distribution shifts, such as temporal, demographic, and label shifts, and (ii) limited diversity in datasets that are siloed within single medical institutions. While these limitations have spurred interest in federated learning, current evaluation benchmarks fail to evaluate different shifts simultaneously. However, in real healthcare settings, multiple types of shifts co-exist, yet their impact on medical imaging performance remains unstudied. In response, we introduce FedMedICL, a unified framework and benchmark to holistically evaluate federated medical imaging challenges, simultaneously capturing label, demographic, and temporal distribution shifts. We comprehensively evaluate several popular methods on six diverse medical imaging datasets (totaling 550 GPU hours). Furthermore, we use FedMedICL to simulate COVID-19 propagation across hospitals and evaluate whether methods can adapt to pandemic changes in disease prevalence. We find that a simple batch balancing technique surpasses advanced methods in average performance across FedMedICL experiments. This finding questions the applicability of results from previous, narrow benchmarks in real-world medical settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes