CVLGNov 19, 2021

Enhanced countering adversarial attacks via input denoising and feature restoring

arXiv:2111.10075v1Has Code
Originality Incremental advance
AI Analysis

It addresses the problem of adversarial attacks for AI security, offering an incremental improvement over existing defense methods.

This paper tackles the vulnerability of deep neural networks to adversarial examples by proposing IDFR, a method combining input denoising and feature restoring, which outperforms state-of-the-art defenses on benchmark datasets.

Despite the fact that deep neural networks (DNNs) have achieved prominent performance in various applications, it is well known that DNNs are vulnerable to adversarial examples/samples (AEs) with imperceptible perturbations in clean/original samples. To overcome the weakness of the existing defense methods against adversarial attacks, which damages the information on the original samples, leading to the decrease of the target classifier accuracy, this paper presents an enhanced countering adversarial attack method IDFR (via Input Denoising and Feature Restoring). The proposed IDFR is made up of an enhanced input denoiser (ID) and a hidden lossy feature restorer (FR) based on the convex hull optimization. Extensive experiments conducted on benchmark datasets show that the proposed IDFR outperforms the various state-of-the-art defense methods, and is highly effective for protecting target models against various adversarial black-box or white-box attacks. \footnote{Souce code is released at: \href{https://github.com/ID-FR/IDFR}{https://github.com/ID-FR/IDFR}}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes