MLLGPRJan 12, 2022

Eigenvalue Distribution of Large Random Matrices Arising in Deep Neural Networks: Orthogonal Case

arXiv:2201.04543v27 citations
Originality Synthesis-oriented
AI Analysis

This provides theoretical justification for eigenvalue behavior in orthogonal weight neural networks, which is incremental as it builds on existing proofs for Gaussian weights.

The paper rigorously proves that the singular value distribution of the Jacobian in deep untrained neural networks with Haar-distributed orthogonal weight matrices and Gaussian biases matches a known random matrix theory case in the infinite-width limit, extending prior work from Gaussian to orthogonal weights.

The paper deals with the distribution of singular values of the input-output Jacobian of deep untrained neural networks in the limit of their infinite width. The Jacobian is the product of random matrices where the independent rectangular weight matrices alternate with diagonal matrices whose entries depend on the corresponding column of the nearest neighbor weight matrix. The problem was considered in \cite{Pe-Co:18} for the Gaussian weights and biases and also for the weights that are Haar distributed orthogonal matrices and Gaussian biases. Basing on a free probability argument, it was claimed that in these cases the singular value distribution of the Jacobian in the limit of infinite width (matrix size) coincides with that of the analog of the Jacobian with special random but weight independent diagonal matrices, the case well known in random matrix theory. The claim was rigorously proved in \cite{Pa-Sl:21} for a quite general class of weights and biases with i.i.d. (including Gaussian) entries by using a version of the techniques of random matrix theory. In this paper we use another version of the techniques to justify the claim for random Haar distributed weight matrices and Gaussian biases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes