CVAILGMLOct 6, 2023

Assessing Robustness via Score-Based Adversarial Image Generation

arXiv:2310.04285v36 citationsh-index: 12
Originality Highly original
AI Analysis

This work addresses the problem of limited robustness evaluations in machine learning for researchers and practitioners by proposing a more encompassing approach, though it is incremental in advancing adversarial generation methods.

The paper tackles the limitation of traditional adversarial attacks bounded by small $\ell_p$-norm constraints by introducing ScoreAG, a framework that uses score-based generative models to generate unrestricted adversarial examples while preserving image semantics, and it empirically improves upon most state-of-the-art attacks and defenses across multiple benchmarks.

Most adversarial attacks and defenses focus on perturbations within small $\ell_p$-norm constraints. However, $\ell_p$ threat models cannot capture all relevant semantics-preserving perturbations, and hence, the scope of robustness evaluations is limited. In this work, we introduce Score-Based Adversarial Generation (ScoreAG), a novel framework that leverages the advancements in score-based generative models to generate unrestricted adversarial examples that overcome the limitations of $\ell_p$-norm constraints. Unlike traditional methods, ScoreAG maintains the core semantics of images while generating adversarial examples, either by transforming existing images or synthesizing new ones entirely from scratch. We further exploit the generative capability of ScoreAG to purify images, empirically enhancing the robustness of classifiers. Our extensive empirical evaluation demonstrates that ScoreAG improves upon the majority of state-of-the-art attacks and defenses across multiple benchmarks. This work highlights the importance of investigating adversarial examples bounded by semantics rather than $\ell_p$-norm constraints. ScoreAG represents an important step towards more encompassing robustness assessments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes