CVCRLGMLJun 14, 2019

Copy and Paste: A Simple But Effective Initialization Method for Black-Box Adversarial Attacks

arXiv:1906.06086v210 citations
Originality Incremental advance
AI Analysis

This work addresses query efficiency in black-box adversarial attacks for image classification, presenting an incremental improvement over existing methods.

The paper tackled the problem of initializing black-box adversarial attacks on image classifiers, showing that copying small patches from other images as starting points reduces the number of queries required for a state-of-the-art Boundary Attack by 81% on ImageNet.

Many optimization methods for generating black-box adversarial examples have been proposed, but the aspect of initializing said optimizers has not been considered in much detail. We show that the choice of starting points is indeed crucial, and that the performance of state-of-the-art attacks depends on it. First, we discuss desirable properties of starting points for attacking image classifiers, and how they can be chosen to increase query efficiency. Notably, we find that simply copying small patches from other images is a valid strategy. We then present an evaluation on ImageNet that clearly demonstrates the effectiveness of this method: Our initialization scheme reduces the number of queries required for a state-of-the-art Boundary Attack by 81%, significantly outperforming previous results reported for targeted black-box adversarial examples.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes