LG IV SY MLOct 29, 2024

LipKernel: Lipschitz-Bounded Convolutional Neural Networks via Dissipative Layers

Patricia Pauli, Ruigang Wang, Ian Manchester, Frank Allgöwer

arXiv:2410.22258v16.42 citationsh-index: 34Has Code

Originality Highly original

AI Analysis

This work addresses robustness for real-time perception and control in applications like robotics and autonomous vehicles, offering a more expressive and efficient approach than prior methods.

The paper tackles the problem of ensuring robustness in convolutional neural networks by proposing LipKernel, a layer-wise parameterization that enforces Lipschitz bounds via dissipative layers, resulting in run-time improvements orders of magnitude faster than state-of-the-art methods.

We propose a novel layer-wise parameterization for convolutional neural networks (CNNs) that includes built-in robustness guarantees by enforcing a prescribed Lipschitz bound. Each layer in our parameterization is designed to satisfy a linear matrix inequality (LMI), which in turn implies dissipativity with respect to a specific supply rate. Collectively, these layer-wise LMIs ensure Lipschitz boundedness for the input-output mapping of the neural network, yielding a more expressive parameterization than through spectral bounds or orthogonal layers. Our new method LipKernel directly parameterizes dissipative convolution kernels using a 2-D Roesser-type state space model. This means that the convolutional layers are given in standard form after training and can be evaluated without computational overhead. In numerical experiments, we show that the run-time using our method is orders of magnitude faster than state-of-the-art Lipschitz-bounded networks that parameterize convolutions in the Fourier domain, making our approach particularly attractive for improving robustness of learning-based real-time perception or control in robotics, autonomous vehicles, or automation systems. We focus on CNNs, and in contrast to previous works, our approach accommodates a wide variety of layers typically used in CNNs, including 1-D and 2-D convolutional layers, maximum and average pooling layers, as well as strided and dilated convolutions and zero padding. However, our approach naturally extends beyond CNNs as we can incorporate any layer that is incrementally dissipative.

View on arXiv PDF Code

Similar