CLMay 11, 2023

KGA: A General Machine Unlearning Framework Based on Knowledge Gap Alignment

arXiv:2305.06535v1266 citations
Originality Incremental advance
AI Analysis

It addresses the need for unlearning in NLP to handle sensitive personal information in text, but is incremental as it adapts existing concepts to new domains.

The paper tackles machine unlearning for NLP tasks by proposing the KGA framework, which maintains distribution differences to induce forgetfulness, and shows comprehensive improvements over baselines on large-scale datasets.

Recent legislation of the "right to be forgotten" has led to the interest in machine unlearning, where the learned models are endowed with the function to forget information about specific training instances as if they have never existed in the training set. Previous work mainly focuses on computer vision scenarios and largely ignores the essentials of unlearning in NLP field, where text data contains more explicit and sensitive personal information than images. In this paper, we propose a general unlearning framework called KGA to induce forgetfulness. Different from previous work that tries to recover gradients or forces models to perform close to one specific distribution, KGA maintains distribution differences (i.e., knowledge gap). This relaxes the distribution assumption. Furthermore, we first apply the unlearning method to various NLP tasks (i.e., classification, translation, response generation) and propose several unlearning evaluation metrics with pertinence. Experiments on large-scale datasets show that KGA yields comprehensive improvements over baselines, where extensive analyses further validate the effectiveness of KGA and provide insight into unlearning for NLP tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes