LG CRNov 20, 2023

Understanding Variation in Subpopulation Susceptibility to Poisoning Attacks

arXiv:2311.11544v13.81 citationsh-index: 8

Originality Incremental advance

AI Analysis

This work addresses security vulnerabilities in ML for practitioners by identifying factors that make specific subpopulations more susceptible to attacks, though it is incremental as it builds on prior observations of disparate attack effectiveness.

The paper investigates why certain subpopulations are more vulnerable to poisoning attacks in machine learning, finding that dataset separability and loss differences between clean and target models are key factors, with empirical results showing these properties generalize to high-dimensional datasets like Adult.

Machine learning is susceptible to poisoning attacks, in which an attacker controls a small fraction of the training data and chooses that data with the goal of inducing some behavior unintended by the model developer in the trained model. We consider a realistic setting in which the adversary with the ability to insert a limited number of data points attempts to control the model's behavior on a specific subpopulation. Inspired by previous observations on disparate effectiveness of random label-flipping attacks on different subpopulations, we investigate the properties that can impact the effectiveness of state-of-the-art poisoning attacks against different subpopulations. For a family of 2-dimensional synthetic datasets, we empirically find that dataset separability plays a dominant role in subpopulation vulnerability for less separable datasets. However, well-separated datasets exhibit more dependence on individual subpopulation properties. We further discover that a crucial subpopulation property is captured by the difference in loss on the clean dataset between the clean model and a target model that misclassifies the subpopulation, and a subpopulation is much easier to attack if the loss difference is small. This property also generalizes to high-dimensional benchmark datasets. For the Adult benchmark dataset, we show that we can find semantically-meaningful subpopulation properties that are related to the susceptibilities of a selected group of subpopulations. The results in this paper are accompanied by a fully interactive web-based visualization of subpopulation poisoning attacks found at https://uvasrg.github.io/visualizing-poisoning

View on arXiv PDF

Similar