LGNEAGMLMay 29, 2019

On the Expressive Power of Deep Polynomial Neural Networks

arXiv:1905.12207v1108 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical framework for understanding network design in machine learning, though it is incremental in advancing mathematical foundations.

The paper tackles the problem of quantifying the expressive power of deep polynomial neural networks by proposing the dimension of an algebraic variety as a measure, obtaining theoretical results on architecture dependencies and computational evidence for optimal layer width patterns.

We study deep neural networks with polynomial activations, particularly their expressive power. For a fixed architecture and activation degree, a polynomial neural network defines an algebraic map from weights to polynomials. The image of this map is the functional space associated to the network, and it is an irreducible algebraic variety upon taking closure. This paper proposes the dimension of this variety as a precise measure of the expressive power of polynomial neural networks. We obtain several theoretical results regarding this dimension as a function of architecture, including an exact formula for high activation degrees, as well as upper and lower bounds on layer widths in order for deep polynomials networks to fill the ambient functional space. We also present computational evidence that it is profitable in terms of expressiveness for layer widths to increase monotonically and then decrease monotonically. Finally, we link our study to favorable optimization properties when training weights, and we draw intriguing connections with tensor and polynomial decompositions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes