ML LGJun 19, 2020

No one-hidden-layer neural network can represent multivariable functions

Masayo Inoue, Mana Futamura, Hirokazu Ninomiya

arXiv:2006.10977v1

Originality Incremental advance

AI Analysis

This work addresses a foundational limitation in neural network theory, showing that single-hidden-layer architectures are insufficient for approximating some multivariable functions, which is incremental but clarifies theoretical boundaries.

The paper demonstrates that a one-hidden-layer neural network with ReLU activation cannot precisely represent certain smooth binary functions, while also presenting constraints on parameters and second derivatives for unary functions to improve implementation accuracy.

In a function approximation with a neural network, an input dataset is mapped to an output index by optimizing the parameters of each hidden-layer unit. For a unary function, we present constraints on the parameters and its second derivative by constructing a continuum version of a one-hidden-layer neural network with the rectified linear unit (ReLU) activation function. The network is accurately implemented because the constraints decrease the degrees of freedom of the parameters. We also explain the existence of a smooth binary function that cannot be precisely represented by any such neural network.

View on arXiv PDF

Similar