CVMar 9, 2022

Practical No-box Adversarial Attacks with Training-free Hybrid Image Transformation

arXiv:2203.04607v317 citationsh-index: 61
Originality Highly original
AI Analysis

This addresses a practical security vulnerability in AI systems by enabling efficient, real-time attacks without model access, though it is incremental over existing no-box methods.

The paper tackles the problem of no-box adversarial attacks on deep neural networks, where attackers have no knowledge of the target model or dataset, by proposing a training-free method that manipulates high-frequency components in images, achieving an average success rate of 98.13% on ten models, outperforming prior no-box attacks by 29.39%.

In recent years, the adversarial vulnerability of deep neural networks (DNNs) has raised increasing attention. Among all the threat models, no-box attacks are the most practical but extremely challenging since they neither rely on any knowledge of the target model or similar substitute model, nor access the dataset for training a new substitute model. Although a recent method has attempted such an attack in a loose sense, its performance is not good enough and computational overhead of training is expensive. In this paper, we move a step forward and show the existence of a \textbf{training-free} adversarial perturbation under the no-box threat model, which can be successfully used to attack different DNNs in real-time. Motivated by our observation that high-frequency component (HFC) domains in low-level features and plays a crucial role in classification, we attack an image mainly by manipulating its frequency components. Specifically, the perturbation is manipulated by suppression of the original HFC and adding of noisy HFC. We empirically and experimentally analyze the requirements of effective noisy HFC and show that it should be regionally homogeneous, repeating and dense. Extensive experiments on the ImageNet dataset demonstrate the effectiveness of our proposed no-box method. It attacks ten well-known models with a success rate of \textbf{98.13\%} on average, which outperforms state-of-the-art no-box attacks by \textbf{29.39\%}. Furthermore, our method is even competitive to mainstream transfer-based black-box attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes