CVCRLGNov 6, 2018

SparseFool: a few pixels make a big difference

arXiv:1811.02248v4230 citations
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in image classification systems by enabling efficient sparse attacks, though it is incremental as it builds on existing sparse perturbation concepts.

The paper tackles the problem of efficiently computing sparse adversarial perturbations for deep neural networks, proposing SparseFool, which exploits low mean curvature of decision boundaries to achieve fast computation and scalability to high-dimensional data, with adversarial training showing only slight robustness improvements.

Deep Neural Networks have achieved extraordinary results on image classification tasks, but have been shown to be vulnerable to attacks with carefully crafted perturbations of the input data. Although most attacks usually change values of many image's pixels, it has been shown that deep networks are also vulnerable to sparse alterations of the input. However, no computationally efficient method has been proposed to compute sparse perturbations. In this paper, we exploit the low mean curvature of the decision boundary, and propose SparseFool, a geometry inspired sparse attack that controls the sparsity of the perturbations. Extensive evaluations show that our approach computes sparse perturbations very fast, and scales efficiently to high dimensional data. We further analyze the transferability and the visual effects of the perturbations, and show the existence of shared semantic information across the images and the networks. Finally, we show that adversarial training can only slightly improve the robustness against sparse additive perturbations computed with SparseFool.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes