LGOCPRMLApr 20, 2023

Approximation and interpolation of deep neural networks

arXiv:2304.10552v24 citationsh-index: 8
Originality Incremental advance
AI Analysis

This provides theoretical guarantees for deep learning practitioners on network expressiveness and optimization landscapes, though it is incremental as it builds on existing approximation theory.

The paper proves that overparametrized deep neural networks can universally approximate and interpolate any dataset given non-affine, locally L^1 activation functions, and characterizes the Hessian of the loss at interpolation points as forming a manifold under smoothness conditions.

In this paper, we prove that in the overparametrized regime, deep neural network provide universal approximations and can interpolate any data set, as long as the activation function is locally in $L^1(\RR)$ and not an affine function. Additionally, if the activation function is smooth and such an interpolation networks exists, then the set of parameters which interpolate forms a manifold. Furthermore, we give a characterization of the Hessian of the loss function evaluated at the interpolation points. In the last section, we provide a practical probabilistic method of finding such a point under general conditions on the activation function.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes