LGFAJan 11, 2022

Uniform Approximation with Quadratic Neural Networks

arXiv:2201.03747v3
Originality Incremental advance
AI Analysis

This provides a theoretical foundation for the approximation capabilities of quadratic and higher-order rectified units in neural networks, which is incremental as it extends known results to this specific activation family.

The paper tackles the problem of approximating Hölder-regular functions using deep neural networks with ReQU activation, proving that they can achieve any accuracy ε with at most O(ε^{-d/2r}) neurons and a fixed number of layers.

In this work, we examine the approximation capabilities of deep neural networks utilizing the Rectified Quadratic Unit (ReQU) activation function, defined as \(\max(0,x)^2\), for approximating Hölder-regular functions with respect to the uniform norm. We constructively prove that deep neural networks with ReQU activation can approximate any function within the \(R\)-ball of \(r\)-Hölder-regular functions (\(\mathcal{H}^{r, R}([-1,1]^d)\)) up to any accuracy \(ε\) with at most \(\mathcal{O}\left(ε^{-d /2r}\right)\) neurons and fixed number of layers. This result highlights that the effectiveness of the approximation depends significantly on the smoothness of the target function and the characteristics of the ReQU activation function. Our proof is based on approximating local Taylor expansions with deep ReQU neural networks, demonstrating their ability to capture the behavior of Hölder-regular functions effectively. Furthermore, the results can be straightforwardly generalized to any Rectified Power Unit (RePU) activation function of the form \(\max(0,x)^p\) for \(p \geq 2\), indicating the broader applicability of our findings within this family of activations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes