CVMar 3, 2020

Data-Free Adversarial Perturbations for Practical Black-Box Attack

arXiv:2003.01295v114 citations
AI Analysis

This addresses a practical security vulnerability in deep learning models for attackers in black-box scenarios, representing an incremental improvement over existing universal adversarial perturbation methods.

The paper tackles the problem of data dependence in black-box adversarial attacks by introducing a data-free method for crafting adversarial perturbations, achieving high fooling rates on target models without access to training data.

Neural networks are vulnerable to adversarial examples, which are malicious inputs crafted to fool pre-trained models. Adversarial examples often exhibit black-box attacking transferability, which allows that adversarial examples crafted for one model can fool another model. However, existing black-box attack methods require samples from the training data distribution to improve the transferability of adversarial examples across different models. Because of the data dependence, the fooling ability of adversarial perturbations is only applicable when training data are accessible. In this paper, we present a data-free method for crafting adversarial perturbations that can fool a target model without any knowledge about the training data distribution. In the practical setting of a black-box attack scenario where attackers do not have access to target models and training data, our method achieves high fooling rates on target models and outperforms other universal adversarial perturbation methods. Our method empirically shows that current deep learning models are still at risk even when the attackers do not have access to training data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes