LG CR MLAug 20, 2018

Stochastic Combinatorial Ensembles for Defending Against Adversarial Examples

George A. Adam, Petr Smirnov, David Duvenaud, Benjamin Haibe-Kains, Anna Goldenberg

arXiv:1808.06645v22.910 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of adversarial attacks in deep learning for security-critical applications, offering a novel defense method with incremental improvements over existing approaches.

The paper tackles the vulnerability of deep learning models to adversarial examples by introducing a probabilistic framework that generates large ensembles from a single model at linear cost, using stochastic noise removal operators like VAEs between layers, and demonstrates reduced adversarial transferability with empirical results on model gradients.

Many deep learning algorithms can be easily fooled with simple adversarial examples. To address the limitations of existing defenses, we devised a probabilistic framework that can generate an exponentially large ensemble of models from a single model with just a linear cost. This framework takes advantage of neural network depth and stochastically decides whether or not to insert noise removal operators such as VAEs between layers. We show empirically the important role that model gradients have when it comes to determining transferability of adversarial examples, and take advantage of this result to demonstrate that it is possible to train models with limited adversarial attack transferability. Additionally, we propose a detection method based on metric learning in order to detect adversarial examples that have no hope of being cleaned of maliciously engineered noise.

View on arXiv PDF

Similar