MLLGJun 19, 2020

No one-hidden-layer neural network can represent multivariable functions

arXiv:2006.10977v1
Originality Incremental advance
AI Analysis

This work addresses a foundational limitation in neural network theory, showing that single-hidden-layer architectures are insufficient for approximating some multivariable functions, which is incremental but clarifies theoretical boundaries.

The paper demonstrates that a one-hidden-layer neural network with ReLU activation cannot precisely represent certain smooth binary functions, while also presenting constraints on parameters and second derivatives for unary functions to improve implementation accuracy.

In a function approximation with a neural network, an input dataset is mapped to an output index by optimizing the parameters of each hidden-layer unit. For a unary function, we present constraints on the parameters and its second derivative by constructing a continuum version of a one-hidden-layer neural network with the rectified linear unit (ReLU) activation function. The network is accurately implemented because the constraints decrease the degrees of freedom of the parameters. We also explain the existence of a smooth binary function that cannot be precisely represented by any such neural network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes