CVMar 3, 2020

Data-Free Adversarial Perturbations for Practical Black-Box Attack

ZhaoXin Huan, Yulong Wang, Xiaolu Zhang, Lin Shang, Chilin Fu, Jun Zhou

arXiv:2003.01295v15.014 citations

Originality Incremental advance

AI Analysis

This addresses a practical security vulnerability in deep learning models for attackers in black-box scenarios, representing an incremental improvement over existing universal adversarial perturbation methods.

The paper tackles the problem of data dependence in black-box adversarial attacks by introducing a data-free method for crafting adversarial perturbations, achieving high fooling rates on target models without access to training data.

Neural networks are vulnerable to adversarial examples, which are malicious inputs crafted to fool pre-trained models. Adversarial examples often exhibit black-box attacking transferability, which allows that adversarial examples crafted for one model can fool another model. However, existing black-box attack methods require samples from the training data distribution to improve the transferability of adversarial examples across different models. Because of the data dependence, the fooling ability of adversarial perturbations is only applicable when training data are accessible. In this paper, we present a data-free method for crafting adversarial perturbations that can fool a target model without any knowledge about the training data distribution. In the practical setting of a black-box attack scenario where attackers do not have access to target models and training data, our method achieves high fooling rates on target models and outperforms other universal adversarial perturbation methods. Our method empirically shows that current deep learning models are still at risk even when the attackers do not have access to training data.

View on arXiv PDF

Similar