CVLGNEJun 23, 2020

NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks

arXiv:2006.12813v115 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of designing efficient neural networks for resource-constrained applications, though it is incremental as it builds on existing pruning and scaling methods.

The paper tackles the problem of optimizing neuron configurations in deep neural networks to maximize accuracy under parameter constraints, achieving accuracy increases of 3.04% to 8.56% on benchmark datasets like CIFAR10 and CIFAR100.

Deciding the amount of neurons during the design of a deep neural network to maximize performance is not intuitive. In this work, we attempt to search for the neuron (filter) configuration of a fixed network architecture that maximizes accuracy. Using iterative pruning methods as a proxy, we parameterize the change of the neuron (filter) number of each layer with respect to the change in parameters, allowing us to efficiently scale an architecture across arbitrary sizes. We also introduce architecture descent which iteratively refines the parameterized function used for model scaling. The combination of both proposed methods is coined as NeuralScale. To prove the efficiency of NeuralScale in terms of parameters, we show empirical simulations on VGG11, MobileNetV2 and ResNet18 using CIFAR10, CIFAR100 and TinyImageNet as benchmark datasets. Our results show an increase in accuracy of 3.04%, 8.56% and 3.41% for VGG11, MobileNetV2 and ResNet18 on CIFAR10, CIFAR100 and TinyImageNet respectively under a parameter-constrained setting (output neurons (filters) of default configuration with scaling factor of 0.25).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes