LGAIMay 12, 2023

Learn to Unlearn: A Survey on Machine Unlearning

arXiv:2305.07512v24 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental survey paper that provides resources for integrating privacy, equity, and resilience into ML systems, targeting researchers and practitioners concerned with data privacy and model management.

The paper surveys machine unlearning techniques, which address the problem of removing specific data samples from trained ML models to comply with privacy regulations and mitigate issues like data poisoning, by reviewing methods, verification, and attacks.

Machine Learning (ML) models have been shown to potentially leak sensitive information, thus raising privacy concerns in ML-driven applications. This inspired recent research on removing the influence of specific data samples from a trained ML model. Such efficient removal would enable ML to comply with the "right to be forgotten" in many legislation, and could also address performance bottlenecks from low-quality or poisonous samples. In that context, machine unlearning methods have been proposed to erase the contributions of designated data samples on models, as an alternative to the often impracticable approach of retraining models from scratch. This article presents a comprehensive review of recent machine unlearning techniques, verification mechanisms, and potential attacks. We further highlight emerging challenges and prospective research directions (e.g. resilience and fairness concerns). We aim for this paper to provide valuable resources for integrating privacy, equity, andresilience into ML systems and help them "learn to unlearn".

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes