LGMLJun 28, 2018

ResNet with one-neuron hidden layers is a Universal Approximator

arXiv:1806.10909v2253 citations
Originality Incremental advance
AI Analysis

This provides a theoretical foundation for the representational power of ResNets in deep learning, addressing a fundamental limitation in neural network architecture for researchers and practitioners.

The paper tackles the problem of whether narrow deep networks can universally approximate functions, showing that a very deep ResNet with one-neuron hidden layers and ReLU activations can uniformly approximate any Lebesgue integrable function in d dimensions, in contrast to fully connected networks which fail at width d.

We demonstrate that a very deep ResNet with stacked modules with one neuron per hidden layer and ReLU activation functions can uniformly approximate any Lebesgue integrable function in $d$ dimensions, i.e. $\ell_1(\mathbb{R}^d)$. Because of the identity mapping inherent to ResNets, our network has alternating layers of dimension one and $d$. This stands in sharp contrast to fully connected networks, which are not universal approximators if their width is the input dimension $d$ [Lu et al, 2017; Hanin and Sellke, 2017]. Hence, our result implies an increase in representational power for narrow deep networks by the ResNet architecture.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes