CVCRDec 2, 2021

Is RobustBench/AutoAttack a suitable Benchmark for Adversarial Robustness?

arXiv:2112.01601v412 citations
Originality Synthesis-oriented
AI Analysis

This challenges the suitability of a widely used benchmark for adversarial robustness in machine learning, potentially impacting researchers and practitioners relying on it for model evaluation.

The paper critiques RobustBench/AutoAttack as a benchmark for adversarial robustness, arguing that its l-inf perturbations on CIFAR10 are unrealistically strong and easily detectable, and that results do not generalize well to higher-resolution images.

Recently, RobustBench (Croce et al. 2020) has become a widely recognized benchmark for the adversarial robustness of image classification networks. In its most commonly reported sub-task, RobustBench evaluates and ranks the adversarial robustness of trained neural networks on CIFAR10 under AutoAttack (Croce and Hein 2020b) with l-inf perturbations limited to eps = 8/255. With leading scores of the currently best performing models of around 60% of the baseline, it is fair to characterize this benchmark to be quite challenging. Despite its general acceptance in recent literature, we aim to foster discussion about the suitability of RobustBench as a key indicator for robustness which could be generalized to practical applications. Our line of argumentation against this is two-fold and supported by excessive experiments presented in this paper: We argue that I) the alternation of data by AutoAttack with l-inf, eps = 8/255 is unrealistically strong, resulting in close to perfect detection rates of adversarial samples even by simple detection algorithms and human observers. We also show that other attack methods are much harder to detect while achieving similar success rates. II) That results on low-resolution data sets like CIFAR10 do not generalize well to higher resolution images as gradient-based attacks appear to become even more detectable with increasing resolutions.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes