LGNEOCMar 20, 2021

Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces

arXiv:2103.11154v211 citations
AI Analysis

This addresses the computational burden of training large DNNs for researchers and practitioners, though it is incremental as it builds on existing low-dimensional landscape hypotheses.

The paper tackles the problem of training deep neural networks (DNNs) efficiently by leveraging redundancy in parameters, showing that DNNs can be trained in low-dimensional subspaces with comparable performance to full-parameter training, achieving results in 40-dimensional spaces.

Deep neural networks (DNNs) usually contain massive parameters, but there is redundancy such that it is guessed that the DNNs could be trained in low-dimensional subspaces. In this paper, we propose a Dynamic Linear Dimensionality Reduction (DLDR) based on low-dimensional properties of the training trajectory. The reduction is efficient, which is supported by comprehensive experiments: optimization in 40 dimensional spaces can achieve comparable performance as regular training over thousands or even millions of parameters. Since there are only a few optimization variables, we develop a quasi-Newton-based algorithm and also obtain robustness against label noises, which are two follow-up experiments to show the advantages of finding low-dimensional subspaces.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes