CV LGNov 19, 2021

Enhanced countering adversarial attacks via input denoising and feature restoring

Yanni Li, Wenhui Zhang, Jiawei Liu, Xiaoli Kou, Hui Li, Jiangtao Cui

arXiv:2111.10075v11.4h-index: 46Has Code

Originality Incremental advance

AI Analysis

It addresses the problem of adversarial attacks for AI security, offering an incremental improvement over existing defense methods.

This paper tackles the vulnerability of deep neural networks to adversarial examples by proposing IDFR, a method combining input denoising and feature restoring, which outperforms state-of-the-art defenses on benchmark datasets.

Despite the fact that deep neural networks (DNNs) have achieved prominent performance in various applications, it is well known that DNNs are vulnerable to adversarial examples/samples (AEs) with imperceptible perturbations in clean/original samples. To overcome the weakness of the existing defense methods against adversarial attacks, which damages the information on the original samples, leading to the decrease of the target classifier accuracy, this paper presents an enhanced countering adversarial attack method IDFR (via Input Denoising and Feature Restoring). The proposed IDFR is made up of an enhanced input denoiser (ID) and a hidden lossy feature restorer (FR) based on the convex hull optimization. Extensive experiments conducted on benchmark datasets show that the proposed IDFR outperforms the various state-of-the-art defense methods, and is highly effective for protecting target models against various adversarial black-box or white-box attacks. \footnote{Souce code is released at: \href{https://github.com/ID-FR/IDFR}{https://github.com/ID-FR/IDFR}}

View on arXiv PDF Code

Similar