CVAug 11, 2022

Diverse Generative Perturbations on Attention Space for Transferable Adversarial Attacks

arXiv:2208.05650v217 citationsh-index: 38Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of creating adversarial examples that fool unknown models, which is crucial for practical security testing, though it is an incremental improvement over existing methods.

The paper tackles the problem of low transferability in adversarial attacks by proposing the Attentive-Diversity Attack (ADA), which disrupts diverse salient features stochastically to improve exploration of the loss surface, resulting in outperforming state-of-the-art methods in transferability.

Adversarial attacks with improved transferability - the ability of an adversarial example crafted on a known model to also fool unknown models - have recently received much attention due to their practicality. Nevertheless, existing transferable attacks craft perturbations in a deterministic manner and often fail to fully explore the loss surface, thus falling into a poor local optimum and suffering from low transferability. To solve this problem, we propose Attentive-Diversity Attack (ADA), which disrupts diverse salient features in a stochastic manner to improve transferability. Primarily, we perturb the image attention to disrupt universal features shared by different models. Then, to effectively avoid poor local optima, we disrupt these features in a stochastic manner and explore the search space of transferable perturbations more exhaustively. More specifically, we use a generator to produce adversarial perturbations that each disturbs features in different ways depending on an input latent code. Extensive experimental evaluations demonstrate the effectiveness of our method, outperforming the transferability of state-of-the-art methods. Codes are available at https://github.com/wkim97/ADA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes