MLLGPRSep 29, 2025

Quantitative convergence of trained single layer neural networks to Gaussian processes

arXiv:2509.24544v2h-index: 1
Originality Incremental advance
AI Analysis

This work addresses a theoretical gap for researchers in machine learning by offering precise finite-width estimates, though it is incremental as it builds on prior qualitative convergence results.

The paper tackles the problem of quantifying how shallow neural networks converge to Gaussian processes during training, providing explicit upper bounds on the approximation error that decay polynomially with network width.

In this paper, we study the quantitative convergence of shallow neural networks trained via gradient descent to their associated Gaussian processes in the infinite-width limit. While previous work has established qualitative convergence under broad settings, precise, finite-width estimates remain limited, particularly during training. We provide explicit upper bounds on the quadratic Wasserstein distance between the network output and its Gaussian approximation at any training time $t \ge 0$, demonstrating polynomial decay with network width. Our results quantify how architectural parameters, such as width and input dimension, influence convergence, and how training dynamics affect the approximation error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes