LG AIMay 12, 2023

Learn to Unlearn: A Survey on Machine Unlearning

Youyang Qu, Xin Yuan, Ming Ding, Wei Ni, Thierry Rakotoarivelo, David Smith

arXiv:2305.07512v213.04 citations

Originality Synthesis-oriented

AI Analysis

This is an incremental survey paper that provides resources for integrating privacy, equity, and resilience into ML systems, targeting researchers and practitioners concerned with data privacy and model management.

The paper surveys machine unlearning techniques, which address the problem of removing specific data samples from trained ML models to comply with privacy regulations and mitigate issues like data poisoning, by reviewing methods, verification, and attacks.

Machine Learning (ML) models have been shown to potentially leak sensitive information, thus raising privacy concerns in ML-driven applications. This inspired recent research on removing the influence of specific data samples from a trained ML model. Such efficient removal would enable ML to comply with the "right to be forgotten" in many legislation, and could also address performance bottlenecks from low-quality or poisonous samples. In that context, machine unlearning methods have been proposed to erase the contributions of designated data samples on models, as an alternative to the often impracticable approach of retraining models from scratch. This article presents a comprehensive review of recent machine unlearning techniques, verification mechanisms, and potential attacks. We further highlight emerging challenges and prospective research directions (e.g. resilience and fairness concerns). We aim for this paper to provide valuable resources for integrating privacy, equity, andresilience into ML systems and help them "learn to unlearn".

View on arXiv PDF

Similar