CVFeb 9, 2022

Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations

arXiv:2202.04235v348 citations
Originality Incremental advance
AI Analysis

This addresses the need for more realistic adversarial robustness in machine learning models by moving beyond single perturbation types to composite scenarios, representing an incremental advance in the field.

The paper tackles the problem of adversarial robustness against composite semantic perturbations, proposing a method that generalizes adversarial training to handle combinations of attacks like hue, saturation, brightness, contrast, and rotation, and shows it outperforms baseline approaches on ImageNet and CIFAR-10 datasets.

Model robustness against adversarial examples of single perturbation type such as the $\ell_{p}$-norm has been widely studied, yet its generalization to more realistic scenarios involving multiple semantic perturbations and their composition remains largely unexplored. In this paper, we first propose a novel method for generating composite adversarial examples. Our method can find the optimal attack composition by utilizing component-wise projected gradient descent and automatic attack-order scheduling. We then propose generalized adversarial training (GAT) to extend model robustness from $\ell_{p}$-ball to composite semantic perturbations, such as the combination of Hue, Saturation, Brightness, Contrast, and Rotation. Results obtained using ImageNet and CIFAR-10 datasets indicate that GAT can be robust not only to all the tested types of a single attack, but also to any combination of such attacks. GAT also outperforms baseline $\ell_{\infty}$-norm bounded adversarial training approaches by a significant margin.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes