LGOct 28, 2022

Improving Lipschitz-Constrained Neural Networks by Learning Activation Functions

Stanislas Ducotterd, Alexis Goujon, Pakshal Bohra, Dimitris Perdios, Sebastian Neumayer, Michael Unser

arXiv:2210.16222v217.729 citationsh-index: 104Has Code

Originality Incremental advance

AI Analysis

This work addresses a bottleneck in robust deep learning for applications requiring stability, though it is incremental as it builds on known expressive architectures.

The paper tackles the poor performance of Lipschitz-constrained neural networks with ReLU activations by showing that networks with learnable 1-Lipschitz linear splines are optimal under a constrained functional optimization problem and proposing an efficient training method, with numerical experiments showing favorable comparisons to existing architectures.

Lipschitz-constrained neural networks have several advantages over unconstrained ones and can be applied to a variety of problems, making them a topic of attention in the deep learning community. Unfortunately, it has been shown both theoretically and empirically that they perform poorly when equipped with ReLU activation functions. By contrast, neural networks with learnable 1-Lipschitz linear splines are known to be more expressive. In this paper, we show that such networks correspond to global optima of a constrained functional optimization problem that consists of the training of a neural network composed of 1-Lipschitz linear layers and 1-Lipschitz freeform activation functions with second-order total-variation regularization. Further, we propose an efficient method to train these neural networks. Our numerical experiments show that our trained networks compare favorably with existing 1-Lipschitz neural architectures.

View on arXiv PDF Code

Similar