MLLGJul 1, 2021

On the Expected Complexity of Maxout Networks

arXiv:2107.00379v218 citations
Originality Incremental advance
AI Analysis

This work provides incremental insights into the complexity of neural networks, relevant for researchers in machine learning theory.

The authors studied the expected complexity of maxout neural networks, showing that their practical complexity is often far from the theoretical maximum, similar to ReLU networks, and obtained nontrivial lower bounds on this complexity.

Learning with neural networks relies on the complexity of the representable functions, but more importantly, the particular assignment of typical parameters to functions of different complexity. Taking the number of activation regions as a complexity measure, recent works have shown that the practical complexity of deep ReLU networks is often far from the theoretical maximum. In this work, we show that this phenomenon also occurs in networks with maxout (multi-argument) activation functions and when considering the decision boundaries in classification tasks. We also show that the parameter space has a multitude of full-dimensional regions with widely different complexity, and obtain nontrivial lower bounds on the expected complexity. Finally, we investigate different parameter initialization procedures and show that they can increase the speed of convergence in training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes