CRNov 11, 2020

Detecting Adversarial Patches with Class Conditional Reconstruction Networks

arXiv:2011.05850v22 citations
AI Analysis

This work addresses defense against physical adversarial attacks in deep learning, which is an incremental improvement over existing detection methods.

The paper tackled the problem of detecting physical adversarial patches using autoencoder-based detectors, finding that the detector retains some effectiveness against adaptive attacks but performance decreases with dataset complexity.

Defending against physical adversarial attacks is a rapidly growing topic in deep learning and computer vision. Prominent forms of physical adversarial attacks, such as overlaid adversarial patches and objects, share similarities with digital attacks, but are easy for humans to notice. This leads us to explore the hypothesis that adversarial detection methods, which have been shown to be ineffective against adaptive digital adversarial examples, can be effective against these physical attacks. We use one such detection method based on autoencoder architectures, and perform adversarial patching experiments on MNIST, SVHN, and CIFAR10 against a CNN architecture and two CapsNet architectures. We also propose two modifications to the EM-Routed CapsNet architecture, Affine Voting and Matrix Capsule Dropout, to improve its classification performance. Our investigation shows that the detector retains some of its effectiveness even against adaptive adversarial patch attacks. In addition, detection performance tends to decrease among all the architectures with the increase of dataset complexity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes