LGCRJun 21, 2022

The Privacy Onion Effect: Memorization is Relative

ETH Zurich
arXiv:2206.10469v2158 citationsh-index: 63
Originality Incremental advance
AI Analysis

This reveals a fundamental flaw in non-rigorous privacy defenses, impacting users of machine learning systems where data privacy is critical.

The paper tackles the problem of privacy leakage in machine learning models by demonstrating that removing outlier points vulnerable to memorization exposes previously safe points to attacks, showing this effect in experiments.

Machine learning models trained on private datasets have been shown to leak their private data. While recent work has found that the average data point is rarely leaked, the outlier samples are frequently subject to memorization and, consequently, privacy leakage. We demonstrate and analyse an Onion Effect of memorization: removing the "layer" of outlier points that are most vulnerable to a privacy attack exposes a new layer of previously-safe points to the same attack. We perform several experiments to study this effect, and understand why it occurs. The existence of this effect has various consequences. For example, it suggests that proposals to defend against memorization without training with rigorous privacy guarantees are unlikely to be effective. Further, it suggests that privacy-enhancing technologies such as machine unlearning could actually harm the privacy of other users.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes