CV LG NEJun 12, 2018

Adversarial Attacks on Variational Autoencoders

George Gondim-Ribeiro, Pedro Tabacof, Eduardo Valle

arXiv:1806.04646v116.044 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses safety concerns for autoencoders used in data compression, but the work is incremental as it builds on existing adversarial attack research.

The paper tackles the problem of adversarial attacks on variational autoencoders by proposing an attack scheme and evaluation framework, showing that DRAW's recurrence and attention mechanism improve resistance across three datasets.

Adversarial attacks are malicious inputs that derail machine-learning models. We propose a scheme to attack autoencoders, as well as a quantitative evaluation framework that correlates well with the qualitative assessment of the attacks. We assess --- with statistically validated experiments --- the resistance to attacks of three variational autoencoders (simple, convolutional, and DRAW) in three datasets (MNIST, SVHN, CelebA), showing that both DRAW's recurrence and attention mechanism lead to better resistance. As autoencoders are proposed for compressing data --- a scenario in which their safety is paramount --- we expect more attention will be given to adversarial attacks on them.

View on arXiv PDF Code

Similar