ML LGMay 28, 2018

Lipschitz regularity of deep neural networks: analysis and efficient estimation

arXiv:1805.10965v240.4690 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for safe and robust neural network applications by providing more precise Lipschitz estimations, though it is incremental as it builds on previous estimation methods.

The paper tackles the problem of efficiently estimating the Lipschitz constant of deep neural networks to assess their robustness to perturbations, showing that exact computation is NP-hard and proposing AutoLip and SeqLip algorithms that significantly improve upper bounds in experiments.

Deep neural networks are notorious for being sensitive to small well-chosen perturbations, and estimating the regularity of such architectures is of utmost importance for safe and robust practical applications. In this paper, we investigate one of the key characteristics to assess the regularity of such methods: the Lipschitz constant of deep learning architectures. First, we show that, even for two layer neural networks, the exact computation of this quantity is NP-hard and state-of-art methods may significantly overestimate it. Then, we both extend and improve previous estimation methods by providing AutoLip, the first generic algorithm for upper bounding the Lipschitz constant of any automatically differentiable function. We provide a power method algorithm working with automatic differentiation, allowing efficient computations even on large convolutions. Second, for sequential neural networks, we propose an improved algorithm named SeqLip that takes advantage of the linear computation graph to split the computation per pair of consecutive layers. Third we propose heuristics on SeqLip in order to tackle very large networks. Our experiments show that SeqLip can significantly improve on the existing upper bounds. Finally, we provide an implementation of AutoLip in the PyTorch environment that may be used to better estimate the robustness of a given neural network to small perturbations or regularize it using more precise Lipschitz estimations.

View on arXiv PDF Code

Similar