CVJun 26, 2024

Towards Synchronous Memorizability and Generalizability with Site-Modulated Diffusion Replay for Cross-Site Continual Segmentation

Dunyuan Xu, Xi Wang, Jingyang Zhang, Pheng-Ann Heng

arXiv:2406.18037v32.01 citationsHas Code

Originality Highly original

AI Analysis

This addresses a critical challenge for medical AI systems that need to learn sequentially from multiple data sites while maintaining privacy, though it is incremental as it builds on existing CL and DG approaches.

The paper tackles the problem of catastrophic forgetting and poor generalization in cross-site continual medical image segmentation by proposing a novel training paradigm called SMG-Learning, which uses Parallel Gradient Alignment and a Site-Modulated Diffusion model to replay images, achieving better memorizability and generalizability than state-of-the-art methods.

The ability to learn sequentially from different data sites is crucial for a deep network in solving practical medical image diagnosis problems due to privacy restrictions and storage limitations. However, adapting on incoming site leads to catastrophic forgetting on past sites and decreases generalizablity on unseen sites. Existing Continual Learning (CL) and Domain Generalization (DG) methods have been proposed to solve these two challenges respectively, but none of them can address both simultaneously. Recognizing this limitation, this paper proposes a novel training paradigm, learning towards Synchronous Memorizability and Generalizability (SMG-Learning). To achieve this, we create the orientational gradient alignment to ensure memorizability on previous sites, and arbitrary gradient alignment to enhance generalizability on unseen sites. This approach is named as Parallel Gradient Alignment (PGA). Furthermore, we approximate the PGA as dual meta-objectives using the first-order Taylor expansion to reduce computational cost of aligning gradients. Considering that performing gradient alignments, especially for previous sites, is not feasible due to the privacy constraints, we design a Site-Modulated Diffusion (SMD) model to generate images with site-specific learnable prompts, replaying images have similar data distributions as previous sites. We evaluate our method on two medical image segmentation tasks, where data from different sites arrive sequentially. Experimental results show that our method efficiently enhances both memorizability and generalizablity better than other state-of-the-art methods, delivering satisfactory performance across all sites. Our code will be available at: https://github.com/dyxu-cuhkcse/SMG-Learning.

View on arXiv PDF Code

Similar