CV CRJun 5, 2024

ZeroPur: Succinct Training-Free Adversarial Purification

Erhu Liu, Zonglin Yang, Bo Liu, Bin Xiao, Xiuli Bi

arXiv:2406.03143v33.71 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the need for efficient and domain-independent adversarial defense for machine learning practitioners, though it is incremental as it builds on existing purification concepts.

The paper tackles the problem of defending against adversarial attacks without retraining models by proposing ZeroPur, a training-free adversarial purification method that shifts and projects adversarial images back to the natural image manifold, achieving state-of-the-art robust performance on datasets like CIFAR-10, CIFAR-100, and ImageNet-1K.

Adversarial purification is a kind of defense technique that can defend against various unseen adversarial attacks without modifying the victim classifier. Existing methods often depend on external generative models or cooperation between auxiliary functions and victim classifiers. However, retraining generative models, auxiliary functions, or victim classifiers relies on the domain of the fine-tuned dataset and is computation-consuming. In this work, we suppose that adversarial images are outliers of the natural image manifold, and the purification process can be considered as returning them to this manifold. Following this assumption, we present a simple adversarial purification method without further training to purify adversarial images, called ZeroPur. ZeroPur contains two steps: given an adversarial example, Guided Shift obtains the shifted embedding of the adversarial example by the guidance of its blurred counterparts; after that, Adaptive Projection constructs a directional vector by this shifted embedding to provide momentum, projecting adversarial images onto the manifold adaptively. ZeroPur is independent of external models and requires no retraining of victim classifiers or auxiliary functions, relying solely on victim classifiers themselves to achieve purification. Extensive experiments on three datasets (CIFAR-10, CIFAR-100, and ImageNet-1K) using various classifier architectures (ResNet, WideResNet) demonstrate that our method achieves state-of-the-art robust performance. The code will be publicly available.

View on arXiv PDF Code

Similar