CVSOC-PHJun 18, 2021

Analyzing Adversarial Robustness of Deep Neural Networks in Pixel Space: a Semantic Perspective

arXiv:2106.09872v1
Originality Incremental advance
AI Analysis

This work addresses security concerns for image classifiers by revealing region-specific vulnerabilities, but it is incremental as it builds on existing adversarial attack methods.

The paper tackles the problem of adversarial robustness in deep neural networks by analyzing pixel-level vulnerabilities from a semantic perspective, showing that one-pixel attacks can succeed by targeting specific regions rather than the entire image, with experimental verification on CIFAR-10 and ImageNet.

The vulnerability of deep neural networks to adversarial examples, which are crafted maliciously by modifying the inputs with imperceptible perturbations to misled the network produce incorrect outputs, reveals the lack of robustness and poses security concerns. Previous works study the adversarial robustness of image classifiers on image level and use all the pixel information in an image indiscriminately, lacking of exploration of regions with different semantic meanings in the pixel space of an image. In this work, we fill this gap and explore the pixel space of the adversarial image by proposing an algorithm to looking for possible perturbations pixel by pixel in different regions of the segmented image. The extensive experimental results on CIFAR-10 and ImageNet verify that searching for the modified pixel in only some pixels of an image can successfully launch the one-pixel adversarial attacks without requiring all the pixels of the entire image, and there exist multiple vulnerable points scattered in different regions of an image. We also demonstrate that the adversarial robustness of different regions on the image varies with the amount of semantic information contained.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes