ML CR CV LGJun 17, 2021

Localized Uncertainty Attacks

Ousmane Amadou Dia, Theofanis Karaletsos, Caner Hazirbas, Cristian Canton Ferrer, Ilknur Kaynar Kabul, Erik Meijer

arXiv:2106.09222v15.02 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of creating more imperceptible adversarial examples for security and robustness in machine learning, representing an incremental improvement over existing attack methods.

The paper tackles the problem of adversarial attacks on deep learning models by introducing localized uncertainty attacks, which perturb only uncertain regions of inputs, resulting in adversarial examples that are less perceptible and retain greater similarity to original inputs.

The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic and stochastic classifiers. Under this threat model, we create adversarial examples by perturbing only regions in the inputs where a classifier is uncertain. To find such regions, we utilize the predictive uncertainty of the classifier when the classifier is stochastic or, we learn a surrogate model to amortize the uncertainty when it is deterministic. Unlike $\ell_p$ ball or functional attacks which perturb inputs indiscriminately, our targeted changes can be less perceptible. When considered under our threat model, these attacks still produce strong adversarial examples; with the examples retaining a greater degree of similarity with the inputs.

View on arXiv PDF

Similar