LGCVMLJul 20, 2020

ThriftyNets : Convolutional Neural Networks with Tiny Parameter Budget

arXiv:2007.10106v1
AI Analysis

This addresses the problem of high computational and memory costs in deep learning for resource-constrained applications, though it is an incremental improvement in network design.

The paper tackles the inefficiency of parameter distribution in deep convolutional neural networks by proposing ThriftyNet, a recursive architecture that maximizes parameter factorization, achieving over 91% accuracy on CIFAR-10 with under 40K parameters and 74.3% on CIFAR-100 with under 600K parameters.

Typical deep convolutional architectures present an increasing number of feature maps as we go deeper in the network, whereas spatial resolution of inputs is decreased through downsampling operations. This means that most of the parameters lay in the final layers, while a large portion of the computations are performed by a small fraction of the total parameters in the first layers. In an effort to use every parameter of a network at its maximum, we propose a new convolutional neural network architecture, called ThriftyNet. In ThriftyNet, only one convolutional layer is defined and used recursively, leading to a maximal parameter factorization. In complement, normalization, non-linearities, downsamplings and shortcut ensure sufficient expressivity of the model. ThriftyNet achieves competitive performance on a tiny parameters budget, exceeding 91% accuracy on CIFAR-10 with less than 40K parameters in total, and 74.3% on CIFAR-100 with less than 600K parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes