LGCVFeb 25, 2021

Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints

arXiv:2102.12827v384 citations
AI Analysis

This provides a more efficient tool for researchers and practitioners assessing adversarial robustness in machine learning models, though it is incremental as it builds on existing attack methods.

The paper tackles the problem of efficiently finding minimum adversarial perturbations for evaluating model robustness, proposing a fast minimum-norm attack that works across different norms, requires no tuning or initialization, and converges quickly with comparable or smaller perturbation sizes.

Evaluating adversarial robustness amounts to finding the minimum perturbation needed to have an input sample misclassified. The inherent complexity of the underlying optimization requires current gradient-based attacks to be carefully tuned, initialized, and possibly executed for many computationally-demanding iterations, even if specialized to a given perturbation model. In this work, we overcome these limitations by proposing a fast minimum-norm (FMN) attack that works with different $\ell_p$-norm perturbation models ($p=0, 1, 2, \infty$), is robust to hyperparameter choices, does not require adversarial starting points, and converges within few lightweight steps. It works by iteratively finding the sample misclassified with maximum confidence within an $\ell_p$-norm constraint of size $ε$, while adapting $ε$ to minimize the distance of the current sample to the decision boundary. Extensive experiments show that FMN significantly outperforms existing attacks in terms of convergence speed and computation time, while reporting comparable or even smaller perturbation sizes.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes