LGCRMLJan 26, 2020

Ensemble Noise Simulation to Handle Uncertainty about Gradient-based Adversarial Attacks

arXiv:2001.09486v11 citations
AI Analysis

This addresses a gap in adversarial defense for scenarios with uncertain attacker behavior, though it is incremental as it builds on existing pre-processing defenses.

The paper tackles the problem of defending neural networks against gradient-based adversarial attacks when there is uncertainty about the attacker's behavior, by simulating noisy perturbations using an ensemble of attack algorithms and training a Denoising Autoencoder defense, resulting in significant improvements in post-attack accuracy.

Gradient-based adversarial attacks on neural networks can be crafted in a variety of ways by varying either how the attack algorithm relies on the gradient, the network architecture used for crafting the attack, or both. Most recent work has focused on defending classifiers in a case where there is no uncertainty about the attacker's behavior (i.e., the attacker is expected to generate a specific attack using a specific network architecture). However, if the attacker is not guaranteed to behave in a certain way, the literature lacks methods in devising a strategic defense. We fill this gap by simulating the attacker's noisy perturbation using a variety of attack algorithms based on gradients of various classifiers. We perform our analysis using a pre-processing Denoising Autoencoder (DAE) defense that is trained with the simulated noise. We demonstrate significant improvements in post-attack accuracy, using our proposed ensemble-trained defense, compared to a situation where no effort is made to handle uncertainty.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes