LGCRMLMay 1, 2020

Bullseye Polytope: A Scalable Clean-Label Poisoning Attack with Improved Transferability

arXiv:2005.00191v3127 citations
AI Analysis

This addresses security vulnerabilities in neural networks for applications like transfer learning, representing an incremental improvement over existing poisoning attacks.

The paper tackles the problem of clean-label dataset poisoning attacks on neural networks by proposing Bullseye Polytope, a scalable and transferable attack that improves the state-of-the-art attack success rate by 26.75% in end-to-end transfer learning and increases attack speed by 12 times.

A recent source of concern for the security of neural networks is the emergence of clean-label dataset poisoning attacks, wherein correctly labeled poison samples are injected into the training dataset. While these poison samples look legitimate to the human observer, they contain malicious characteristics that trigger a targeted misclassification during inference. We propose a scalable and transferable clean-label poisoning attack against transfer learning, which creates poison images with their center close to the target image in the feature space. Our attack, Bullseye Polytope, improves the attack success rate of the current state-of-the-art by 26.75% in end-to-end transfer learning, while increasing attack speed by a factor of 12. We further extend Bullseye Polytope to a more practical attack model by including multiple images of the same object (e.g., from different angles) when crafting the poison samples. We demonstrate that this extension improves attack transferability by over 16% to unseen images (of the same object) without using extra poison samples.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes