LGCRCVNov 20, 2017

Evaluating Robustness of Neural Networks with Mixed Integer Programming

arXiv:1711.07356v3373 citations
Originality Highly original
AI Analysis

This work addresses the critical issue of neural network security for AI practitioners by providing a faster and more accurate verification tool, though it is incremental in improving existing verification techniques.

The paper tackles the problem of verifying neural network robustness against adversarial examples by formulating it as a mixed integer program, achieving a two to three orders of magnitude speedup over state-of-the-art methods and enabling the first exact determination of adversarial accuracy for an MNIST classifier, finding adversarial examples for 4.38% of samples.

Neural networks have demonstrated considerable success on a wide variety of real-world problems. However, networks trained only to optimize for training accuracy can often be fooled by adversarial examples - slightly perturbed inputs that are misclassified with high confidence. Verification of networks enables us to gauge their vulnerability to such adversarial examples. We formulate verification of piecewise-linear neural networks as a mixed integer program. On a representative task of finding minimum adversarial distortions, our verifier is two to three orders of magnitude quicker than the state-of-the-art. We achieve this computational speedup via tight formulations for non-linearities, as well as a novel presolve algorithm that makes full use of all information available. The computational speedup allows us to verify properties on convolutional networks with an order of magnitude more ReLUs than networks previously verified by any complete verifier. In particular, we determine for the first time the exact adversarial accuracy of an MNIST classifier to perturbations with bounded $l_\infty$ norm $ε=0.1$: for this classifier, we find an adversarial example for 4.38% of samples, and a certificate of robustness (to perturbations with bounded norm) for the remainder. Across all robust training procedures and network architectures considered, we are able to certify more samples than the state-of-the-art and find more adversarial examples than a strong first-order attack.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes