CVLGNEJun 12, 2018

Adversarial Attacks on Variational Autoencoders

arXiv:1806.04646v144 citations
Originality Incremental advance
AI Analysis

This addresses safety concerns for autoencoders used in data compression, but the work is incremental as it builds on existing adversarial attack research.

The paper tackles the problem of adversarial attacks on variational autoencoders by proposing an attack scheme and evaluation framework, showing that DRAW's recurrence and attention mechanism improve resistance across three datasets.

Adversarial attacks are malicious inputs that derail machine-learning models. We propose a scheme to attack autoencoders, as well as a quantitative evaluation framework that correlates well with the qualitative assessment of the attacks. We assess --- with statistically validated experiments --- the resistance to attacks of three variational autoencoders (simple, convolutional, and DRAW) in three datasets (MNIST, SVHN, CelebA), showing that both DRAW's recurrence and attention mechanism lead to better resistance. As autoencoders are proposed for compressing data --- a scenario in which their safety is paramount --- we expect more attention will be given to adversarial attacks on them.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes