ML LG NEJul 1, 2015

Natural Neural Networks

Guillaume Desjardins, Karen Simonyan, Razvan Pascanu, Koray Kavukcuoglu

arXiv:1507.00210v132.9190 citations

Originality Incremental advance

AI Analysis

This method addresses convergence speed issues for machine learning practitioners, offering a scalable solution that is incremental in improving optimization techniques.

The paper tackles the problem of slow convergence in neural network training by introducing Natural Neural Networks, which adapt internal representations to improve conditioning of the Fisher matrix, resulting in faster training on tasks like the ImageNet Challenge dataset.

We introduce Natural Neural Networks, a novel family of algorithms that speed up convergence by adapting their internal representation during training to improve conditioning of the Fisher matrix. In particular, we show a specific example that employs a simple and efficient reparametrization of the neural network weights by implicitly whitening the representation obtained at each layer, while preserving the feed-forward computation of the network. Such networks can be trained efficiently via the proposed Projected Natural Gradient Descent algorithm (PRONG), which amortizes the cost of these reparametrizations over many parameter updates and is closely related to the Mirror Descent online learning algorithm. We highlight the benefits of our method on both unsupervised and supervised learning tasks, and showcase its scalability by training on the large-scale ImageNet Challenge dataset.

View on arXiv PDF

Similar