LGMar 30, 2023

Mole Recruitment: Poisoning of Image Classifiers via Selective Batch Sampling

arXiv:2303.17080v1h-index: 43
Originality Highly original
AI Analysis

This work exposes a previously undetected vulnerability in state-of-the-art image classifiers, posing a security risk for applications relying on machine learning models.

The paper tackles the problem of data poisoning attacks on image classifiers by introducing Mole Recruitment, a method that degrades model performance by selectively sampling natural training samples that are most confusing between classes, without altering images or labels. The attack significantly reduces performance on targeted classes across standard datasets and is shown to be viable in real-world continual learning settings.

In this work, we present a data poisoning attack that confounds machine learning models without any manipulation of the image or label. This is achieved by simply leveraging the most confounding natural samples found within the training data itself, in a new form of a targeted attack coined "Mole Recruitment." We define moles as the training samples of a class that appear most similar to samples of another class, and show that simply restructuring training batches with an optimal number of moles can lead to significant degradation in the performance of the targeted class. We show the efficacy of this novel attack in an offline setting across several standard image classification datasets, and demonstrate the real-world viability of this attack in a continual learning (CL) setting. Our analysis reveals that state-of-the-art models are susceptible to Mole Recruitment, thereby exposing a previously undetected vulnerability of image classifiers.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes