CRLGMLOct 18, 2016

Membership Inference Attacks against Machine Learning Models

arXiv:1610.05820v25255 citations
Originality Highly original
AI Analysis

This addresses privacy risks for users of machine learning services, particularly in sensitive domains like healthcare, by exposing vulnerabilities in widely used commercial models.

The paper tackles the problem of machine learning models leaking information about their training data through membership inference attacks, where an adversary can determine if a specific record was part of the training set, and shows that commercial models from providers like Google and Amazon are vulnerable to such attacks.

We quantitatively investigate how machine learning models leak information about the individual data records on which they were trained. We focus on the basic membership inference attack: given a data record and black-box access to a model, determine if the record was in the model's training dataset. To perform membership inference against a target model, we make adversarial use of machine learning and train our own inference model to recognize differences in the target model's predictions on the inputs that it trained on versus the inputs that it did not train on. We empirically evaluate our inference techniques on classification models trained by commercial "machine learning as a service" providers such as Google and Amazon. Using realistic datasets and classification tasks, including a hospital discharge dataset whose membership is sensitive from the privacy perspective, we show that these models can be vulnerable to membership inference attacks. We then investigate the factors that influence this leakage and evaluate mitigation strategies.

Code Implementations11 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes