LGMLApr 25, 2020

Memory-efficient training with streaming dimensionality reduction

arXiv:2004.12041v11 citations
Originality Incremental advance
AI Analysis

This addresses memory efficiency challenges for deep learning practitioners, though it appears incremental as it builds on existing dimensionality reduction techniques.

The paper tackles the problem of high memory overhead from moving large data during deep neural network training by introducing streaming batch principal component analysis to generate low-rank gradient approximations. They demonstrate this method can train convolutional neural networks on common datasets with performance comparable to standard mini-batch gradient descent.

The movement of large quantities of data during the training of a Deep Neural Network presents immense challenges for machine learning workloads. To minimize this overhead, especially on the movement and calculation of gradient information, we introduce streaming batch principal component analysis as an update algorithm. Streaming batch principal component analysis uses stochastic power iterations to generate a stochastic k-rank approximation of the network gradient. We demonstrate that the low rank updates produced by streaming batch principal component analysis can effectively train convolutional neural networks on a variety of common datasets, with performance comparable to standard mini batch gradient descent. These results can lead to both improvements in the design of application specific integrated circuits for deep learning and in the speed of synchronization of machine learning models trained with data parallelism.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes