LGNov 20, 2025

Descend or Rewind? Stochastic Gradient Descent Unlearning

arXiv:2511.15983v14.1h-index: 4

Originality Incremental advance

AI Analysis

This work addresses the need for provable unlearning guarantees in machine learning, which is crucial for privacy and compliance, though it is incremental as it builds on existing algorithms by adding theoretical backing.

The authors tackled the problem of providing theoretical guarantees for stochastic gradient descent-based machine unlearning algorithms, specifically Descent-to-Delete (D2D) and Rewind-to-Delete (R2D), by proving (ε, δ) certified unlearning guarantees for strongly convex, convex, and nonconvex loss functions, with D2D offering tighter guarantees for strongly convex functions and R2D enabling unlearning in convex and nonconvex settings.

Machine unlearning algorithms aim to remove the impact of selected training data from a model without the computational expenses of retraining from scratch. Two such algorithms are ``Descent-to-Delete" (D2D) and ``Rewind-to-Delete" (R2D), full-batch gradient descent algorithms that are easy to implement and satisfy provable unlearning guarantees. In particular, the stochastic version of D2D is widely implemented as the ``finetuning" unlearning baseline, despite lacking theoretical backing on nonconvex functions. In this work, we prove $(ε, δ)$ certified unlearning guarantees for stochastic R2D and D2D for strongly convex, convex, and nonconvex loss functions, by analyzing unlearning through the lens of disturbed or biased gradient systems, which may be contracting, semi-contracting, or expansive respectively. Our argument relies on optimally coupling the random behavior of the unlearning and retraining trajectories, resulting in a probabilistic sensitivity bound that can be combined with a novel relaxed Gaussian mechanism to achieve $(ε, δ)$ unlearning. We determine that D2D can yield tighter guarantees for strongly convex functions compared to R2D by relying on contraction to a unique global minimum. However, unlike D2D, R2D can achieve unlearning in the convex and nonconvex setting because it draws the unlearned model closer to the retrained model by reversing the accumulated disturbances.

View on arXiv PDF

Similar