LG CR MLDec 7, 2018

Combatting Adversarial Attacks through Denoising and Dimensionality Reduction: A Cascaded Autoencoder Approach

Rajeev Sahay, Rehana Mahfuz, Aly El Gamal

arXiv:1812.03087v18.338 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses security vulnerabilities in ML models for applications like image recognition, but it is incremental as it builds on existing autoencoder methods for defense.

The paper tackles defending machine learning models against gradient-based adversarial attacks by proposing a cascaded autoencoder pipeline for denoising and dimensionality reduction, resulting in significantly higher classifier accuracy on perturbed test data.

Machine Learning models are vulnerable to adversarial attacks that rely on perturbing the input data. This work proposes a novel strategy using Autoencoder Deep Neural Networks to defend a machine learning model against two gradient-based attacks: The Fast Gradient Sign attack and Fast Gradient attack. First we use an autoencoder to denoise the test data, which is trained with both clean and corrupted data. Then, we reduce the dimension of the denoised data using the hidden layer representation of another autoencoder. We perform this experiment for multiple values of the bound of adversarial perturbations, and consider different numbers of reduced dimensions. When the test data is preprocessed using this cascaded pipeline, the tested deep neural network classifier yields a much higher accuracy, thus mitigating the effect of the adversarial perturbation.

View on arXiv PDF Code

Similar