MLLGOct 5, 2017

Porcupine Neural Networks: (Almost) All Local Optima are Global

arXiv:1710.02196v161 citations
Originality Highly original
AI Analysis

This addresses the optimization difficulty in neural networks for machine learning practitioners, offering a theoretical framework to mitigate local optima issues, though it is incremental as it focuses on two-layer networks.

The paper tackles the non-convex optimization challenge in neural networks by introducing Porcupine Neural Networks (PNNs), which constrain weights to a finite set of lines, showing that most local optima are global and that unconstrained networks can be approximated with polynomially-large PNNs.

Neural networks have been used prominently in several machine learning and statistics applications. In general, the underlying optimization of neural networks is non-convex which makes their performance analysis challenging. In this paper, we take a novel approach to this problem by asking whether one can constrain neural network weights to make its optimization landscape have good theoretical properties while at the same time, be a good approximation for the unconstrained one. For two-layer neural networks, we provide affirmative answers to these questions by introducing Porcupine Neural Networks (PNNs) whose weight vectors are constrained to lie over a finite set of lines. We show that most local optima of PNN optimizations are global while we have a characterization of regions where bad local optimizers may exist. Moreover, our theoretical and empirical results suggest that an unconstrained neural network can be approximated using a polynomially-large PNN.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes