CVMar 9, 2022

Practical No-box Adversarial Attacks with Training-free Hybrid Image Transformation

Qilong Zhang, Youheng Sun, Chaoning Zhang, Chaoqun Li, Xuanhan Wang, Jingkuan Song, Lianli Gao

arXiv:2203.04607v310.117 citationsh-index: 61

Originality Highly original

AI Analysis

This addresses a practical security vulnerability in AI systems by enabling efficient, real-time attacks without model access, though it is incremental over existing no-box methods.

The paper tackles the problem of no-box adversarial attacks on deep neural networks, where attackers have no knowledge of the target model or dataset, by proposing a training-free method that manipulates high-frequency components in images, achieving an average success rate of 98.13% on ten models, outperforming prior no-box attacks by 29.39%.

In recent years, the adversarial vulnerability of deep neural networks (DNNs) has raised increasing attention. Among all the threat models, no-box attacks are the most practical but extremely challenging since they neither rely on any knowledge of the target model or similar substitute model, nor access the dataset for training a new substitute model. Although a recent method has attempted such an attack in a loose sense, its performance is not good enough and computational overhead of training is expensive. In this paper, we move a step forward and show the existence of a \textbf{training-free} adversarial perturbation under the no-box threat model, which can be successfully used to attack different DNNs in real-time. Motivated by our observation that high-frequency component (HFC) domains in low-level features and plays a crucial role in classification, we attack an image mainly by manipulating its frequency components. Specifically, the perturbation is manipulated by suppression of the original HFC and adding of noisy HFC. We empirically and experimentally analyze the requirements of effective noisy HFC and show that it should be regionally homogeneous, repeating and dense. Extensive experiments on the ImageNet dataset demonstrate the effectiveness of our proposed no-box method. It attacks ten well-known models with a success rate of \textbf{98.13\%} on average, which outperforms state-of-the-art no-box attacks by \textbf{29.39\%}. Furthermore, our method is even competitive to mainstream transfer-based black-box attacks.

View on arXiv PDF

Similar