LGCRCVNEMLMar 27, 2019

Scaling up the randomized gradient-free adversarial attack reveals overestimation of robustness using established attacks

arXiv:1903.11359v235 citations
Originality Incremental advance
AI Analysis

This work addresses the need for tighter upper bounds on robustness in adversarial machine learning, revealing overestimation by established attacks, though it is incremental as it builds on prior gradient-free methods.

The authors tackled the problem of overestimating neural network robustness by existing adversarial attacks, and significantly improved a randomized gradient-free attack to achieve similar or smaller robust accuracy than state-of-the-art methods like PGD and Carlini-Wagner.

Modern neural networks are highly non-robust against adversarial manipulation. A significant amount of work has been invested in techniques to compute lower bounds on robustness through formal guarantees and to build provably robust models. However, it is still difficult to get guarantees for larger networks or robustness against larger perturbations. Thus attack strategies are needed to provide tight upper bounds on the actual robustness. We significantly improve the randomized gradient-free attack for ReLU networks [9], in particular by scaling it up to large networks. We show that our attack achieves similar or significantly smaller robust accuracy than state-of-the-art attacks like PGD or the one of Carlini and Wagner, thus revealing an overestimation of the robustness by these state-of-the-art methods. Our attack is not based on a gradient descent scheme and in this sense gradient-free, which makes it less sensitive to the choice of hyperparameters as no careful selection of the stepsize is required.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes