LGNEMLJun 15, 2020

Finding trainable sparse networks through Neural Tangent Transfer

arXiv:2006.08228v240 citations
AI Analysis

This addresses the need for more efficient neural networks in machine learning, offering a novel approach to sparsity without relying on labels, though it is incremental in improving existing pruning methods.

The paper tackles the problem of high memory and energy demands in deep neural networks by introducing Neural Tangent Transfer, a label-free method to find trainable sparse networks that mimic dense network training dynamics, resulting in higher classification performance and faster convergence on standard tasks.

Deep neural networks have dramatically transformed machine learning, but their memory and energy demands are substantial. The requirements of real biological neural networks are rather modest in comparison, and one feature that might underlie this austerity is their sparse connectivity. In deep learning, trainable sparse networks that perform well on a specific task are usually constructed using label-dependent pruning criteria. In this article, we introduce Neural Tangent Transfer, a method that instead finds trainable sparse networks in a label-free manner. Specifically, we find sparse networks whose training dynamics, as characterized by the neural tangent kernel, mimic those of dense networks in function space. Finally, we evaluate our label-agnostic approach on several standard classification tasks and show that the resulting sparse networks achieve higher classification performance while converging faster.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes