LGJan 28, 2021

Improving Neural Network Robustness through Neighborhood Preserving Layers

arXiv:2101.11766v26 citations
Originality Incremental advance
AI Analysis

This work addresses robustness issues in neural networks for security-critical applications, but it appears incremental as it modifies existing layers rather than introducing a new paradigm.

The paper tackles the problem of neural network vulnerability to adversarial attacks by proposing a new neighborhood preserving layer to replace overparameterized fully-connected layers, resulting in improved robustness against gradient-based attacks on MNIST and CIFAR10 datasets.

Robustness against adversarial attack in neural networks is an important research topic in the machine learning community. We observe one major source of vulnerability of neural nets is from overparameterized fully-connected layers. In this paper, we propose a new neighborhood preserving layer which can replace these fully connected layers to improve the network robustness. We demonstrate a novel neural network architecture which can incorporate such layers and also can be trained efficiently. We theoretically prove that our models are more robust against distortion because they effectively control the magnitude of gradients. Finally, we empirically show that our designed network architecture is more robust against state-of-art gradient descent based attacks, such as a PGD attack on the benchmark datasets MNIST and CIFAR10.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes