LG AI MLMar 24, 2024

An Analytic Solution to Covariance Propagation in Neural Networks

Oren Wright, Yorie Nakahira, José M. F. Moura

CMU

arXiv:2403.16163v115.012 citationsh-index: 11Has CodeAISTATS

Originality Highly original

AI Analysis

This addresses the need for reliable uncertainty estimation in deep learning systems, offering a more efficient alternative to costly sampling methods.

The paper tackles the problem of uncertainty quantification in neural networks by introducing a sample-free moment propagation technique that accurately characterizes input-output distributions, achieving analytic solutions for covariance propagation through nonlinear activations like ReLU and GELU.

Uncertainty quantification of neural networks is critical to measuring the reliability and robustness of deep learning systems. However, this often involves costly or inaccurate sampling methods and approximations. This paper presents a sample-free moment propagation technique that propagates mean vectors and covariance matrices across a network to accurately characterize the input-output distributions of neural networks. A key enabler of our technique is an analytic solution for the covariance of random variables passed through nonlinear activation functions, such as Heaviside, ReLU, and GELU. The wide applicability and merits of the proposed technique are shown in experiments analyzing the input-output distributions of trained neural networks and training Bayesian neural networks.

View on arXiv PDF Code

Similar