LG MLJul 6, 2021

Deep Network Approximation: Achieving Arbitrary Accuracy with Fixed Number of Neurons

arXiv:2107.02397v722.062 citations

Originality Highly original

AI Analysis

This provides a theoretical foundation for supervised learning and regression by ensuring dense hypothesis spaces in continuous function spaces, which is foundational for all of ML/AI.

This paper tackles the problem of approximating continuous functions with neural networks by developing simple feed-forward networks that achieve universal approximation with a fixed finite number of neurons, specifically proving that networks with width 36d(2d+1) and depth 11 can approximate any continuous function on a d-dimensional hypercube within arbitrarily small error.

This paper develops simple feed-forward neural networks that achieve the universal approximation property for all continuous functions with a fixed finite number of neurons. These neural networks are simple because they are designed with a simple, computable, and continuous activation function $σ$ leveraging a triangular-wave function and the softsign function. We first prove that $σ$-activated networks with width $36d(2d+1)$ and depth $11$ can approximate any continuous function on a $d$-dimensional hypercube within an arbitrarily small error. Hence, for supervised learning and its related regression problems, the hypothesis space generated by these networks with a size not smaller than $36d(2d+1)\times 11$ is dense in the continuous function space $C([a,b]^d)$ and therefore dense in the Lebesgue spaces $L^p([a,b]^d)$ for $p\in [1,\infty)$. Furthermore, we show that classification functions arising from image and signal classification are in the hypothesis space generated by $σ$-activated networks with width $36d(2d+1)$ and depth $12$ when there exist pairwise disjoint bounded closed subsets of $\mathbb{R}^d$ such that the samples of the same class are located in the same subset. Finally, we use numerical experimentation to show that replacing the rectified linear unit (ReLU) activation function by ours would improve the experiment results.

View on arXiv PDF

Similar