CVApr 15, 2018

SparseNet: A Sparse DenseNet for Image Classification

arXiv:1804.05340v136 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in deep learning for computer vision researchers and practitioners, offering an incremental improvement over existing DenseNet architectures.

The paper tackles the problem of improving parameter and computational efficiency in deep neural networks for image classification by sparsifying DenseNet, reducing connections from O(L^2) to O(L) and adding an attention module. The result is SparseNet, which achieves state-of-the-art improvements on CIFAR10 and SVHN, is 2.6 times smaller and 3.7 times faster than DenseNet while maintaining comparable performance.

Deep neural networks have made remarkable progresses on various computer vision tasks. Recent works have shown that depth, width and shortcut connections of networks are all vital to their performances. In this paper, we introduce a method to sparsify DenseNet which can reduce connections of a L-layer DenseNet from O(L^2) to O(L), and thus we can simultaneously increase depth, width and connections of neural networks in a more parameter-efficient and computation-efficient way. Moreover, an attention module is introduced to further boost our network's performance. We denote our network as SparseNet. We evaluate SparseNet on datasets of CIFAR(including CIFAR10 and CIFAR100) and SVHN. Experiments show that SparseNet can obtain improvements over the state-of-the-art on CIFAR10 and SVHN. Furthermore, while achieving comparable performances as DenseNet on these datasets, SparseNet is x2.6 smaller and x3.7 faster than the original DenseNet.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes