LGFeb 6, 2025

Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention

arXiv:2502.04260v27.11 citationsh-index: 6

Originality Incremental advance

AI Analysis

This addresses privacy and security concerns for participants in generative AI by enabling effective data removal, though it is incremental as it builds on existing unlearning methods for a specific domain.

The paper tackles the problem of machine unlearning for image-to-image generative models by proposing a framework that treats forget samples as out-of-distribution data, achieving this through decoupling with gradient ascent and fine-tuning, with empirical evaluation on ImageNet-1K and Places365 showing superior performance.

Machine Unlearning allows participants to remove their data from a trained machine learning model in order to preserve their privacy, and security. However, the machine unlearning literature for generative models is rather limited. The literature for image-to-image generative model (I2I model) considers minimizing the distance between Gaussian noise and the output of I2I model for forget samples as machine unlearning. However, we argue that the machine learning model performs fairly well on unseen data i.e., a retrained model will be able to catch generic patterns in the data and hence will not generate an output which is equivalent to Gaussian noise. In this paper, we consider that the model after unlearning should treat forget samples as out-of-distribution (OOD) data, i.e., the unlearned model should no longer recognize or encode the specific patterns found in the forget samples. To achieve this, we propose a framework which decouples the model parameters with gradient ascent, ensuring that forget samples are OOD for unlearned model with theoretical guarantee. We also provide $(ε, δ)$-unlearning guarantee for model updates with gradient ascent. The unlearned model is further fine-tuned on the remaining samples to maintain its performance. We also propose an attack model to ensure that the unlearned model has effectively removed the influence of forget samples. Extensive empirical evaluation on two large-scale datasets, ImageNet-1K and Places365 highlights the superiority of our approach. To show comparable performance with retrained model, we also show the comparison of a simple AutoEncoder on various baselines on CIFAR-10 dataset.

View on arXiv PDF

Similar