LG MLDec 11, 2019

Is Feature Diversity Necessary in Neural Network Initialization?

Yaniv Blumenfeld, Dar Gilboa, Daniel Soudry

arXiv:1912.05137v33.43 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the problem of understanding initialization requirements for neural network training, providing insights that could simplify or optimize initialization schemes, though it appears incremental in nature.

The study investigated whether feature diversity is necessary for neural network initialization, finding that while a complete lack of diversity harms training, adding small noise (even from GPU computations) can counteract this, and a network with identical features and mostly zero weights can be trained to match standard initialization accuracy.

Standard practice in training neural networks involves initializing the weights in an independent fashion. The results of recent work suggest that feature "diversity" at initialization plays an important role in training the network. However, other initialization schemes with reduced feature diversity have also been shown to be viable. In this work, we conduct a series of experiments aimed at elucidating the importance of feature diversity at initialization. We show that a complete lack of diversity is harmful to training, but its effects can be counteracted by a relatively small addition of noise - even the noise in standard non-deterministic GPU computations is sufficient. Furthermore, we construct a deep convolutional network with identical features at initialization and almost all of the weights initialized at 0 that can be trained to reach accuracy matching its standard-initialized counterpart.

View on arXiv PDF Code

Similar