ML CR LGAug 29, 2019

White-box vs Black-box: Bayes Optimal Strategies for Membership Inference

Alexandre Sablayrolles, Matthijs Douze, Yann Ollivier, Cordelia Schmid, Hervé Jégou

arXiv:1908.11229v137.9455 citations

Originality Highly original

AI Analysis

This work addresses privacy risks in machine learning by providing a foundational approach to membership inference, which is crucial for data protection but is incremental in refining attack strategies.

The paper tackled the problem of membership inference attacks on machine learning models by deriving the optimal strategy under certain assumptions, showing that black-box attacks can be as effective as white-box attacks. It introduced approximations of this optimal strategy that outperform state-of-the-art methods across various settings, including ResNet-101 on Imagenet.

Membership inference determines, given a sample and trained parameters of a machine learning model, whether the sample was part of the training set. In this paper, we derive the optimal strategy for membership inference with a few assumptions on the distribution of the parameters. We show that optimal attacks only depend on the loss function, and thus black-box attacks are as good as white-box attacks. As the optimal strategy is not tractable, we provide approximations of it leading to several inference methods, and show that existing membership inference methods are coarser approximations of this optimal strategy. Our membership attacks outperform the state of the art in various settings, ranging from a simple logistic regression to more complex architectures and datasets, such as ResNet-101 and Imagenet.

View on arXiv PDF

Similar