LGCRMLOct 28, 2020

Most ReLU Networks Suffer from $\ell^2$ Adversarial Perturbations

arXiv:2010.14927v117 citations
Originality Incremental advance
AI Analysis

This addresses the problem of adversarial vulnerability in neural networks for researchers and practitioners in machine learning security, offering a theoretical explanation for a common phenomenon.

The paper shows that for most ReLU networks with random weights and decreasing dimensions, most input examples have adversarial perturbations at a Euclidean distance of O(‖x‖/√d), where d is the input dimension, and these can be found via gradient flow or descent. This provides an explanation for the abundance of adversarial examples and their discovery through gradient-based methods.

We consider ReLU networks with random weights, in which the dimension decreases at each layer. We show that for most such networks, most examples $x$ admit an adversarial perturbation at an Euclidean distance of $O\left(\frac{\|x\|}{\sqrt{d}}\right)$, where $d$ is the input dimension. Moreover, this perturbation can be found via gradient flow, as well as gradient descent with sufficiently small steps. This result can be seen as an explanation to the abundance of adversarial examples, and to the fact that they are found via gradient descent.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes