CRLGMLMar 14, 2020

Certified Defenses for Adversarial Patches

arXiv:2003.06693v2194 citationsHas Code
Originality Highly original
AI Analysis

This work addresses a critical security problem for real-world computer vision systems, offering a certified defense against practical adversarial threats.

The paper tackles the vulnerability of computer vision systems to adversarial patch attacks by proposing the first certified defense against such attacks, achieving robustness with faster training methods and demonstrating good transfer across different patch shapes.

Adversarial patch attacks are among one of the most practical threat models against real-world computer vision systems. This paper studies certified and empirical defenses against patch attacks. We begin with a set of experiments showing that most existing defenses, which work by pre-processing input images to mitigate adversarial patches, are easily broken by simple white-box adversaries. Motivated by this finding, we propose the first certified defense against patch attacks, and propose faster methods for its training. Furthermore, we experiment with different patch shapes for testing, obtaining surprisingly good robustness transfer across shapes, and present preliminary results on certified defense against sparse attacks. Our complete implementation can be found on: https://github.com/Ping-C/certifiedpatchdefense.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes