CVMar 25, 2017

More is Less: A More Complicated Network with Less Inference Complexity

Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

arXiv:1703.08651v224.6299 citations

Originality Incremental advance

AI Analysis

This addresses the need for faster inference in deep learning models, particularly for resource-constrained applications, though it is incremental as it builds on existing convolutional layers.

The paper tackles the problem of accelerating inference in convolutional neural networks by introducing a more complicated network structure that reduces inference complexity, achieving an average speedup of 32% with negligible performance drop on benchmarks like CIFAR-10, CIFAR-100, and ILSVRC-2012.

In this paper, we present a novel and general network structure towards accelerating the inference process of convolutional neural networks, which is more complicated in network structure yet with less inference complexity. The core idea is to equip each original convolutional layer with another low-cost collaborative layer (LCCL), and the element-wise multiplication of the ReLU outputs of these two parallel layers produces the layer-wise output. The combined layer is potentially more discriminative than the original convolutional layer, and its inference is faster for two reasons: 1) the zero cells of the LCCL feature maps will remain zero after element-wise multiplication, and thus it is safe to skip the calculation of the corresponding high-cost convolution in the original convolutional layer, 2) LCCL is very fast if it is implemented as a 1*1 convolution or only a single filter shared by all channels. Extensive experiments on the CIFAR-10, CIFAR-100 and ILSCRC-2012 benchmarks show that our proposed network structure can accelerate the inference process by 32\% on average with negligible performance drop.

View on arXiv PDF

Similar