LGAICRMay 13, 2022

l-Leaks: Membership Inference Attacks with Logits

arXiv:2205.06469v16 citationsh-index: 29
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in ML models for practitioners, but it is incremental as it builds on existing shadow model approaches.

The paper tackles the problem of membership inference attacks (MIAs) on machine learning models by proposing l-Leaks, a black-box attack that uses logits from the target model to train a shadow model, achieving strong performance across various networks and datasets.

Machine Learning (ML) has made unprecedented progress in the past several decades. However, due to the memorability of the training data, ML is susceptible to various attacks, especially Membership Inference Attacks (MIAs), the objective of which is to infer the model's training data. So far, most of the membership inference attacks against ML classifiers leverage the shadow model with the same structure as the target model. However, empirical results show that these attacks can be easily mitigated if the shadow model is not clear about the network structure of the target model. In this paper, We present attacks based on black-box access to the target model. We name our attack \textbf{l-Leaks}. The l-Leaks follows the intuition that if an established shadow model is similar enough to the target model, then the adversary can leverage the shadow model's information to predict a target sample's membership.The logits of the trained target model contain valuable sample knowledge. We build the shadow model by learning the logits of the target model and making the shadow model more similar to the target model. Then shadow model will have sufficient confidence in the member samples of the target model. We also discuss the effect of the shadow model's different network structures to attack results. Experiments over different networks and datasets demonstrate that both of our attacks achieve strong performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes