LG NE OCMar 20, 2021

Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces

Tao Li, Lei Tan, Qinghua Tao, Yipeng Liu, Xiaolin Huang

arXiv:2103.11154v28.411 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the computational burden of training large DNNs for researchers and practitioners, though it is incremental as it builds on existing low-dimensional landscape hypotheses.

The paper tackles the problem of training deep neural networks (DNNs) efficiently by leveraging redundancy in parameters, showing that DNNs can be trained in low-dimensional subspaces with comparable performance to full-parameter training, achieving results in 40-dimensional spaces.

Deep neural networks (DNNs) usually contain massive parameters, but there is redundancy such that it is guessed that the DNNs could be trained in low-dimensional subspaces. In this paper, we propose a Dynamic Linear Dimensionality Reduction (DLDR) based on low-dimensional properties of the training trajectory. The reduction is efficient, which is supported by comprehensive experiments: optimization in 40 dimensional spaces can achieve comparable performance as regular training over thousands or even millions of parameters. Since there are only a few optimization variables, we develop a quasi-Newton-based algorithm and also obtain robustness against label noises, which are two follow-up experiments to show the advantages of finding low-dimensional subspaces.

View on arXiv PDF Code

Similar