CV AISep 23, 2024

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

arXiv:2409.14747v511.39 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This addresses privacy concerns in AI industries by enabling efficient data removal without compromising model performance, though it is incremental as it builds on existing unlearning methods.

The paper tackles the problem of utility drop in machine unlearning for deep neural networks, where forgetting specific data weakens feature-label correlations, and proposes Distribution-Level Feature Distancing (DLFD) to synthesize data that distances feature distributions from forget samples, achieving state-of-the-art results in forgetting and utility preservation on facial recognition datasets.

With the explosive growth of deep learning applications and increasing privacy concerns, the right to be forgotten has become a critical requirement in various AI industries. For example, given a facial recognition system, some individuals may wish to remove their personal data that might have been used in the training phase. Unfortunately, deep neural networks sometimes unexpectedly leak personal identities, making this removal challenging. While recent machine unlearning algorithms aim to enable models to forget specific data, we identify an unintended utility drop-correlation collapse-in which the essential correlations between image features and true labels weaken during the forgetting process. To address this challenge, we propose Distribution-Level Feature Distancing (DLFD), a novel method that efficiently forgets instances while preserving task-relevant feature correlations. Our method synthesizes data samples by optimizing the feature distribution to be distinctly different from that of forget samples, achieving effective results within a single training epoch. Through extensive experiments on facial recognition datasets, we demonstrate that our approach significantly outperforms state-of-the-art machine unlearning methods in both forgetting performance and model utility preservation.

View on arXiv PDF Code

Similar