IVLGMay 28, 2025

Targeted Unlearning Using Perturbed Sign Gradient Methods With Applications On Medical Images

arXiv:2505.21872v2h-index: 24Trans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This work addresses model maintenance challenges in clinical applications, such as data shifts and policy changes, by offering a modular alternative to retraining, though it is incremental as it builds on existing unlearning concepts.

The paper tackles the problem of machine unlearning for post-deployment model revision in clinical contexts, proposing a bilevel optimization method that outperforms baselines on forgetting and retention metrics across benchmark and real-world datasets.

Machine unlearning aims to remove the influence of specific training samples from a trained model without full retraining. While prior work has largely focused on privacy-motivated settings, we recast unlearning as a general-purpose tool for post-deployment model revision. Specifically, we focus on utilizing unlearning in clinical contexts where data shifts, device deprecation, and policy changes are common. To this end, we propose a bilevel optimization formulation of boundary-based unlearning that can be solved using iterative algorithms. We provide convergence guarantees when first-order algorithms are used to unlearn. Our method introduces tunable loss design for controlling the forgetting-retention tradeoff and supports novel model composition strategies that merge the strengths of distinct unlearning runs. Across benchmark and real-world clinical imaging datasets, our approach outperforms baselines on both forgetting and retention metrics, including scenarios involving imaging devices and anatomical outliers. This work establishes machine unlearning as a modular, practical alternative to retraining for real-world model maintenance in clinical applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes