CVDec 7, 2018

Adversarial Defense of Image Classification Using a Variational Auto-Encoder

arXiv:1812.02891v112 citations
Originality Incremental advance
AI Analysis

This addresses robustness issues in security-sensitive applications, but it is incremental as it builds on existing VAE methods for defense.

The paper tackles the vulnerability of deep neural networks to adversarial attacks in image classification by proposing a variational auto-encoder (VAE) defense, which outperforms or matches JPEG compression for moderate to severe attacks.

Deep neural networks are known to be vulnerable to adversarial attacks. This exposes them to potential exploits in security-sensitive applications and highlights their lack of robustness. This paper uses a variational auto-encoder (VAE) to defend against adversarial attacks for image classification tasks. This VAE defense has a few nice properties: (1) it is quite flexible and its use of randomness makes it harder to attack; (2) it can learn disentangled representations that prevent blurry reconstruction; and (3) a patch-wise VAE defense strategy is used that does not require retraining for different size images. For moderate to severe attacks, this system outperforms or closely matches the performance of JPEG compression, with the best quality parameter. It also has more flexibility and potential for improvement via training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes