LGAIMLMar 24, 2024

An Analytic Solution to Covariance Propagation in Neural Networks

CMU
arXiv:2403.16163v112 citationsh-index: 11AISTATS
Originality Highly original
AI Analysis

This addresses the need for reliable uncertainty estimation in deep learning systems, offering a more efficient alternative to costly sampling methods.

The paper tackles the problem of uncertainty quantification in neural networks by introducing a sample-free moment propagation technique that accurately characterizes input-output distributions, achieving analytic solutions for covariance propagation through nonlinear activations like ReLU and GELU.

Uncertainty quantification of neural networks is critical to measuring the reliability and robustness of deep learning systems. However, this often involves costly or inaccurate sampling methods and approximations. This paper presents a sample-free moment propagation technique that propagates mean vectors and covariance matrices across a network to accurately characterize the input-output distributions of neural networks. A key enabler of our technique is an analytic solution for the covariance of random variables passed through nonlinear activation functions, such as Heaviside, ReLU, and GELU. The wide applicability and merits of the proposed technique are shown in experiments analyzing the input-output distributions of trained neural networks and training Bayesian neural networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes