IV CVJun 28, 2024

Comprehensive Generative Replay for Task-Incremental Segmentation with Concurrent Appearance and Semantic Forgetting

Wei Li, Jingyang Zhang, Pheng-Ann Heng, Lixu Gu

arXiv:2406.19796v111.95 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a privacy-preserving training challenge for generalist segmentation models in medical imaging, though it appears incremental as it builds on existing generative replay methods.

The paper tackles the problem of concurrent appearance and semantic forgetting in task-incremental segmentation by proposing a Comprehensive Generative Replay framework, which uses a Bayesian Joint Diffusion model and Task-Oriented Adapter to synthesize image-mask pairs, showing clear advantages in experiments on cardiac, fundus, and prostate segmentation tasks.

Generalist segmentation models are increasingly favored for diverse tasks involving various objects from different image sources. Task-Incremental Learning (TIL) offers a privacy-preserving training paradigm using tasks arriving sequentially, instead of gathering them due to strict data sharing policies. However, the task evolution can span a wide scope that involves shifts in both image appearance and segmentation semantics with intricate correlation, causing concurrent appearance and semantic forgetting. To solve this issue, we propose a Comprehensive Generative Replay (CGR) framework that restores appearance and semantic knowledge by synthesizing image-mask pairs to mimic past task data, which focuses on two aspects: modeling image-mask correspondence and promoting scalability for diverse tasks. Specifically, we introduce a novel Bayesian Joint Diffusion (BJD) model for high-quality synthesis of image-mask pairs with their correspondence explicitly preserved by conditional denoising. Furthermore, we develop a Task-Oriented Adapter (TOA) that recalibrates prompt embeddings to modulate the diffusion model, making the data synthesis compatible with different tasks. Experiments on incremental tasks (cardiac, fundus and prostate segmentation) show its clear advantage for alleviating concurrent appearance and semantic forgetting. Code is available at https://github.com/jingyzhang/CGR.

View on arXiv PDF Code

Similar