LGMLSep 6, 2024

Epistemic Uncertainty and Observation Noise with the Neural Tangent Kernel

arXiv:2409.03953v23 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses uncertainty quantification in neural networks, which is important for improving reliability in applications like regression, but it is incremental as it builds directly on existing NTK-GP equivalence.

The paper tackles the problem of handling non-zero aleatoric noise and estimating epistemic uncertainty in wide neural networks trained with gradient descent, by extending the Neural Tangent Kernel framework to include these aspects, and demonstrates proof-of-concept on synthetic regression.

Recent work has shown that training wide neural networks with gradient descent is formally equivalent to computing the mean of the posterior distribution in a Gaussian Process (GP) with the Neural Tangent Kernel (NTK) as the prior covariance and zero aleatoric noise \parencite{jacot2018neural}. In this paper, we extend this framework in two ways. First, we show how to deal with non-zero aleatoric noise. Second, we derive an estimator for the posterior covariance, giving us a handle on epistemic uncertainty. Our proposed approach integrates seamlessly with standard training pipelines, as it involves training a small number of additional predictors using gradient descent on a mean squared error loss. We demonstrate the proof-of-concept of our method through empirical evaluation on synthetic regression.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes