LGMLDec 22, 2018

Random Projection in Deep Neural Networks

arXiv:1812.09489v16 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of training deep models on sparse, high-dimensional data for machine learning practitioners, though it is incremental as it builds on existing RP methods.

The paper tackles the problem of training deep neural networks on high-dimensional data by using random projection techniques, showing that prepending an RP layer enables efficient training on datasets with tens of millions of features and achieves competitive or improved performance compared to state-of-the-art methods.

This work investigates the ways in which deep learning methods can benefit from random projection (RP), a classic linear dimensionality reduction method. We focus on two areas where, as we have found, employing RP techniques can improve deep models: training neural networks on high-dimensional data and initialization of network parameters. Training deep neural networks (DNNs) on sparse, high-dimensional data with no exploitable structure implies a network architecture with an input layer that has a huge number of weights, which often makes training infeasible. We show that this problem can be solved by prepending the network with an input layer whose weights are initialized with an RP matrix. We propose several modifications to the network architecture and training regime that makes it possible to efficiently train DNNs with learnable RP layer on data with as many as tens of millions of input features and training examples. In comparison to the state-of-the-art methods, neural networks with RP layer achieve competitive performance or improve the results on several extremely high-dimensional real-world datasets. The second area where the application of RP techniques can be beneficial for training deep models is weight initialization. Setting the initial weights in DNNs to elements of various RP matrices enabled us to train residual deep networks to higher levels of performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes