CVJul 31, 2021

Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework

arXiv:2108.00180v19 citations
Originality Highly original
AI Analysis

This work addresses the problem of adversarial attacks on image classification models for AI security, offering a training-free and attack-agnostic defense that is incremental in improving reconstruction-based methods.

The paper tackles the vulnerability of deep learning image classifiers to adversarial attacks by proposing a novel reconstruction-based defense framework that leverages deep image prior (DIP) to purify adversarial noise and reconstruct images for correct classification, achieving state-of-the-art performance in defending against white-box and defense-aware attacks while maintaining high visual quality.

Deep learning based image classification models are shown vulnerable to adversarial attacks by injecting deliberately crafted noises to clean images. To defend against adversarial attacks in a training-free and attack-agnostic manner, this work proposes a novel and effective reconstruction-based defense framework by delving into deep image prior (DIP). Fundamentally different from existing reconstruction-based defenses, the proposed method analyzes and explicitly incorporates the model decision process into our defense. Given an adversarial image, firstly we map its reconstructed images during DIP optimization to the model decision space, where cross-boundary images can be detected and on-boundary images can be further localized. Then, adversarial noise is purified by perturbing on-boundary images along the reverse direction to the adversarial image. Finally, on-manifold images are stitched to construct an image that can be correctly predicted by the victim classifier. Extensive experiments demonstrate that the proposed method outperforms existing state-of-the-art reconstruction-based methods both in defending white-box attacks and defense-aware attacks. Moreover, the proposed method can maintain a high visual quality during adversarial image reconstruction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes