LG CR MLJun 12, 2023

Gaussian Membership Inference Privacy

Tobias Leemann, Martin Pawelczyk, Gjergji Kasneci

arXiv:2306.07273v218.025 citationsh-index: 37Has Code

Originality Highly original

AI Analysis

This work tackles privacy vulnerabilities in machine learning models for data protection, offering interpretable guarantees and improved utility, though it is incremental in refining existing privacy frameworks.

The paper introduces a new privacy notion called $f$-Membership Inference Privacy ($f$-MIP) to address membership inference attacks, showing that standard SGD training provides basic MIP and that adding noise can amplify it, while also developing an analytical attack that outperforms prior methods by eliminating the need for shadow models.

We propose a novel and practical privacy notion called $f$-Membership Inference Privacy ($f$-MIP), which explicitly considers the capabilities of realistic adversaries under the membership inference attack threat model. Consequently, $f$-MIP offers interpretable privacy guarantees and improved utility (e.g., better classification accuracy). In particular, we derive a parametric family of $f$-MIP guarantees that we refer to as $μ$-Gaussian Membership Inference Privacy ($μ$-GMIP) by theoretically analyzing likelihood ratio-based membership inference attacks on stochastic gradient descent (SGD). Our analysis highlights that models trained with standard SGD already offer an elementary level of MIP. Additionally, we show how $f$-MIP can be amplified by adding noise to gradient updates. Our analysis further yields an analytical membership inference attack that offers two distinct advantages over previous approaches. First, unlike existing state-of-the-art attacks that require training hundreds of shadow models, our attack does not require any shadow model. Second, our analytical attack enables straightforward auditing of our privacy notion $f$-MIP. Finally, we quantify how various hyperparameters (e.g., batch size, number of model parameters) and specific data characteristics determine an attacker's ability to accurately infer a point's membership in the training set. We demonstrate the effectiveness of our method on models trained on vision and tabular datasets.

View on arXiv PDF Code

Similar