LGNEDGDec 12, 2020

Sparsifying networks by traversing Geodesics

arXiv:2012.09605v13 citations
AI Analysis

This work offers an incremental geometric approach to network sparsification and other problems for machine learning researchers.

This paper proposes a mathematical framework to evaluate geodesics in the functional space of neural networks. The authors demonstrate its application in finding high-performance paths from dense to sparser networks, testing on VGG-11 with CIFAR-10 and MLPs with MNIST.

The geometry of weight spaces and functional manifolds of neural networks play an important role towards 'understanding' the intricacies of ML. In this paper, we attempt to solve certain open questions in ML, by viewing them through the lens of geometry, ultimately relating it to the discovery of points or paths of equivalent function in these spaces. We propose a mathematical framework to evaluate geodesics in the functional space, to find high-performance paths from a dense network to its sparser counterpart. Our results are obtained on VGG-11 trained on CIFAR-10 and MLP's trained on MNIST. Broadly, we demonstrate that the framework is general, and can be applied to a wide variety of problems, ranging from sparsification to alleviating catastrophic forgetting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes