LG DS MLApr 21, 2021

Deep limits and cut-off phenomena for neural networks

arXiv:2104.10727v11.6

Originality Incremental advance

AI Analysis

This provides a theoretical framework for deep learning, particularly in geometry and dynamics, which could help in choosing network depth and initialization, though it is incremental in nature.

The paper tackles the problem of understanding deep neural networks' behavior as depth increases, showing that certain limits exist for random layer maps and observing a cut-off phenomenon in random initialization that affects depth selection.

We consider dynamical and geometrical aspects of deep learning. For many standard choices of layer maps we display semi-invariant metrics which quantify differences between data or decision functions. This allows us, when considering random layer maps and using non-commutative ergodic theorems, to deduce that certain limits exist when letting the number of layers tend to infinity. We also examine the random initialization of standard networks where we observe a surprising cut-off phenomenon in terms of the number of layers, the depth of the network. This could be a relevant parameter when choosing an appropriate number of layers for a given learning task, or for selecting a good initialization procedure. More generally, we hope that the notions and results in this paper can provide a framework, in particular a geometric one, for a part of the theoretical understanding of deep neural networks.

View on arXiv PDF

Similar