LGNov 12, 2022

On the High Symmetry of Neural Network Functions

arXiv:2211.06603v11 citationsh-index: 2
Originality Synthesis-oriented
AI Analysis

This provides a foundational mathematical analysis of symmetry in neural networks, which is incremental as it formalizes a known result with new estimates.

The paper tackles the problem of understanding the symmetry in neural network functions, showing that the number of equivalent minima grows factorially with the number of neurons or filters, which has implications for training convergence.

Training neural networks means solving a high-dimensional optimization problem. Normally the goal is to minimize a loss function that depends on what is called the network function, or in other words the function that gives the network output given a certain input. This function depends on a large number of parameters, also known as weights, that depends on the network architecture. In general the goal of this optimization problem is to find the global minimum of the network function. In this paper it is discussed how due to how neural networks are designed, the neural network function present a very large symmetry in the parameter space. This work shows how the neural network function has a number of equivalent minima, in other words minima that give the same value for the loss function and the same exact output, that grows factorially with the number of neurons in each layer for feed forward neural network or with the number of filters in a convolutional neural networks. When the number of neurons and layers is large, the number of equivalent minima grows extremely fast. This will have of course consequences for the study of how neural networks converges to minima during training. This results is known, but in this paper for the first time a proper mathematical discussion is presented and an estimate of the number of equivalent minima is derived.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes