CVJun 21, 2019

Evolution Attack On Neural Networks

arXiv:1906.09072v1
Originality Synthesis-oriented
AI Analysis

This work addresses the vulnerability of neural networks to adversarial attacks for security applications, but it is incremental as it applies existing evolution methods to a known problem.

The paper tackles the problem of generating adversarial examples for neural networks in a black-box setting by framing it as an optimization problem and testing various evolution algorithms, finding that the covariance matrix adaptive evolution strategy performs best.

Many studies have been done to prove the vulnerability of neural networks to adversarial example. A trained and well-behaved model can be fooled by a visually imperceptible perturbation, i.e., an originally correctly classified image could be misclassified after a slight perturbation. In this paper, we propose a black-box strategy to attack such networks using an evolution algorithm. First, we formalize the generation of an adversarial example into the optimization problem of perturbations that represent the noise added to an original image at each pixel. To solve this optimization problem in a black-box way, we find that an evolution algorithm perfectly meets our requirement since it can work without any gradient information. Therefore, we test various evolution algorithms, including a simple genetic algorithm, a parameter-exploring policy gradient, an OpenAI evolution strategy, and a covariance matrix adaptive evolution strategy. Experimental results show that a covariance matrix adaptive evolution Strategy performs best in this optimization problem. Additionally, we also perform several experiments to explore the effect of different regularizations on improving the quality of an adversarial example.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes