LG CROct 5, 2022

A Closer Look at Robustness to L-infinity and Spatial Perturbations and their Composition

Luke Rowe, Benjamin Thérien, Krzysztof Czarnecki, Hongyang Zhang

arXiv:2210.02577v13.31 citationsh-index: 61

Originality Incremental advance

AI Analysis

This work addresses a critical gap in adversarial machine learning by focusing on underexplored compositional attacks, offering insights and a method that could enhance model security for practitioners.

The paper tackles the problem of adversarial robustness to composite attacks combining spatial and L-infinity perturbations, proving that linear classifiers cannot achieve non-trivial accuracy in this setting and proposing a new defense strategy, TRADES_All, which demonstrates strong performance and stability across a wide range of transformations.

In adversarial machine learning, the popular $\ell_\infty$ threat model has been the focus of much previous work. While this mathematical definition of imperceptibility successfully captures an infinite set of additive image transformations that a model should be robust to, this is only a subset of all transformations which leave the semantic label of an image unchanged. Indeed, previous work also considered robustness to spatial attacks as well as other semantic transformations; however, designing defense methods against the composition of spatial and $\ell_{\infty}$ perturbations remains relatively underexplored. In the following, we improve the understanding of this seldom investigated compositional setting. We prove theoretically that no linear classifier can achieve more than trivial accuracy against a composite adversary in a simple statistical setting, illustrating its difficulty. We then investigate how state-of-the-art $\ell_{\infty}$ defenses can be adapted to this novel threat model and study their performance against compositional attacks. We find that our newly proposed TRADES$_{\text{All}}$ strategy performs the strongest of all. Analyzing its logit's Lipschitz constant for RT transformations of different sizes, we find that TRADES$_{\text{All}}$ remains stable over a wide range of RT transformations with and without $\ell_\infty$ perturbations.

View on arXiv PDF

Similar