CVJan 7, 2019

DSConv: Efficient Convolution Operator

arXiv:1901.01928v2109 citations
AI Analysis

This enables efficient deployment of CNNs in resource-constrained environments without retraining, though it is incremental over existing quantization methods.

The paper tackles the problem of quantizing pre-trained convolutional neural networks without labeled data, achieving state-of-the-art results with less than 1% accuracy loss using only 4-bit quantization on models like ResNet and VGG-Net.

Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs). When labelled training data is available, network weights and activations have successfully been quantized down to 1-bit. The same cannot be said about the scenario when labelled training data is not available, e.g. when quantizing a pre-trained model, where current approaches show, at best, no loss of accuracy at 8-bit quantizations. We introduce DSConv, a flexible quantized convolution operator that replaces single-precision operations with their far less expensive integer counterparts, while maintaining the probability distributions over both the kernel weights and the outputs. We test our model as a plug-and-play replacement for standard convolution on most popular neural network architectures, ResNet, DenseNet, GoogLeNet, AlexNet and VGG-Net and demonstrate state-of-the-art results, with less than 1% loss of accuracy, without retraining, using only 4-bit quantization. We also show how a distillation-based adaptation stage with unlabelled data can improve results even further.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes