LGMLDec 11, 2019

Is Feature Diversity Necessary in Neural Network Initialization?

arXiv:1912.05137v33 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of understanding initialization requirements for neural network training, providing insights that could simplify or optimize initialization schemes, though it appears incremental in nature.

The study investigated whether feature diversity is necessary for neural network initialization, finding that while a complete lack of diversity harms training, adding small noise (even from GPU computations) can counteract this, and a network with identical features and mostly zero weights can be trained to match standard initialization accuracy.

Standard practice in training neural networks involves initializing the weights in an independent fashion. The results of recent work suggest that feature "diversity" at initialization plays an important role in training the network. However, other initialization schemes with reduced feature diversity have also been shown to be viable. In this work, we conduct a series of experiments aimed at elucidating the importance of feature diversity at initialization. We show that a complete lack of diversity is harmful to training, but its effects can be counteracted by a relatively small addition of noise - even the noise in standard non-deterministic GPU computations is sufficient. Furthermore, we construct a deep convolutional network with identical features at initialization and almost all of the weights initialized at 0 that can be trained to reach accuracy matching its standard-initialized counterpart.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes