LG MLSep 6, 2024

Epistemic Uncertainty and Observation Noise with the Neural Tangent Kernel

Sergio Calvo-Ordoñez, Konstantina Palla, Kamil Ciosek

arXiv:2409.03953v27.93 citationsh-index: 11

Originality Incremental advance

AI Analysis

This work addresses uncertainty quantification in neural networks, which is important for improving reliability in applications like regression, but it is incremental as it builds directly on existing NTK-GP equivalence.

The paper tackles the problem of handling non-zero aleatoric noise and estimating epistemic uncertainty in wide neural networks trained with gradient descent, by extending the Neural Tangent Kernel framework to include these aspects, and demonstrates proof-of-concept on synthetic regression.

Recent work has shown that training wide neural networks with gradient descent is formally equivalent to computing the mean of the posterior distribution in a Gaussian Process (GP) with the Neural Tangent Kernel (NTK) as the prior covariance and zero aleatoric noise \parencite{jacot2018neural}. In this paper, we extend this framework in two ways. First, we show how to deal with non-zero aleatoric noise. Second, we derive an estimator for the posterior covariance, giving us a handle on epistemic uncertainty. Our proposed approach integrates seamlessly with standard training pipelines, as it involves training a small number of additional predictors using gradient descent on a mean squared error loss. We demonstrate the proof-of-concept of our method through empirical evaluation on synthetic regression.

View on arXiv PDF

Similar