CVLGAug 6, 2019

Full-Stack Filters to Build Minimum Viable CNNs

arXiv:1908.02023v16 citations
AI Analysis

This addresses the challenge of deploying efficient CNNs on edge devices like mobile phones, representing an incremental improvement over existing filter reduction methods.

The paper tackles the problem of over-parameterized CNNs for edge deployment by introducing full-stack filters that generate diverse sub-filters using binary masks, enabling the construction of minimum viable CNNs with comparable performance on benchmark datasets.

Deep convolutional neural networks (CNNs) are usually over-parameterized, which cannot be easily deployed on edge devices such as mobile phones and smart cameras. Existing works used to decrease the number or size of requested convolution filters for a minimum viable CNN on edge devices. In contrast, this paper introduces filters that are full-stack and can be used to generate many more sub-filters. Weights of these sub-filters are inherited from full-stack filters with the help of different binary masks. Orthogonal constraints are applied over binary masks to decrease their correlation and promote the diversity of generated sub-filters. To preserve the same volume of output feature maps, we can naturally reduce the number of established filters by only maintaining a few full-stack filters and a set of binary masks. We also conduct theoretical analysis on the memory cost and an efficient implementation is introduced for the convolution of the proposed filters. Experiments on several benchmark datasets and CNN models demonstrate that the proposed method is able to construct minimum viable convolution networks of comparable performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes