AILGFeb 22, 2018

L2-Nonexpansive Neural Networks

arXiv:1802.07896v475 citations
Originality Highly original
AI Analysis

This addresses robustness issues in neural networks for security-critical applications, representing a novel method rather than incremental improvement.

The paper tackles the problem of creating robust neural networks by proposing L2-nonexpansive networks that limit output changes to input changes, achieving state-of-the-art robustness against white-box L2-bounded adversarial attacks on MNIST and CIFAR-10 without adversarial training.

This paper proposes a class of well-conditioned neural networks in which a unit amount of change in the inputs causes at most a unit amount of change in the outputs or any of the internal layers. We develop the known methodology of controlling Lipschitz constants to realize its full potential in maximizing robustness, with a new regularization scheme for linear layers, new ways to adapt nonlinearities and a new loss function. With MNIST and CIFAR-10 classifiers, we demonstrate a number of advantages. Without needing any adversarial training, the proposed classifiers exceed the state of the art in robustness against white-box L2-bounded adversarial attacks. They generalize better than ordinary networks from noisy data with partially random labels. Their outputs are quantitatively meaningful and indicate levels of confidence and generalization, among other desirable properties.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes