CVFeb 17, 2018

Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet

arXiv:1802.06205v145 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in deep learning for real-world applications by offering a more balanced trade-off between accuracy and resource usage, though it is incremental in improving existing lightweight architectures.

The paper tackles the problem of high computational and memory overheads in deep convolutional networks by proposing SimpNet, a simpler architecture based on design principles and a new SAF-pooling layer, which achieves state-of-the-art results on benchmarks like CIFAR10 and CIFAR100 with 2 to 25 times fewer parameters and operations.

Major winning Convolutional Neural Networks (CNNs), such as VGGNet, ResNet, DenseNet, \etc, include tens to hundreds of millions of parameters, which impose considerable computation and memory overheads. This limits their practical usage in training and optimizing for real-world applications. On the contrary, light-weight architectures, such as SqueezeNet, are being proposed to address this issue. However, they mainly suffer from low accuracy, as they have compromised between the processing power and efficiency. These inefficiencies mostly stem from following an ad-hoc designing procedure. In this work, we discuss and propose several crucial design principles for an efficient architecture design and elaborate intuitions concerning different aspects of the design procedure. Furthermore, we introduce a new layer called {\it SAF-pooling} to improve the generalization power of the network while keeping it simple by choosing best features. Based on such principles, we propose a simple architecture called {\it SimpNet}. We empirically show that SimpNet provides a good trade-off between the computation/memory efficiency and the accuracy solely based on these primitive but crucial principles. SimpNet outperforms the deeper and more complex architectures such as VGGNet, ResNet, WideResidualNet \etc, on several well-known benchmarks, while having 2 to 25 times fewer number of parameters and operations. We obtain state-of-the-art results (in terms of a balance between the accuracy and the number of involved parameters) on standard datasets, such as CIFAR10, CIFAR100, MNIST and SVHN. The implementations are available at \href{url}{https://github.com/Coderx7/SimpNet}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes