CVAINov 30, 2020

Training and Inference for Integer-Based Semantic Segmentation Network

arXiv:2011.14504v1
AI Analysis

This work provides a method to reduce the computational resources required for semantic segmentation, enabling faster training and inference on fixed-point devices for practitioners in resource-constrained environments.

This paper addresses the computational demands of semantic segmentation networks by proposing a novel 8-bit integer-based quantization framework for both training and inference. The framework achieves comparable accuracy to floating-point implementations on the ADE20K and PASCAL VOC 2012 datasets using FCN-VGG16 and DeepLabv3-ResNet50.

Semantic segmentation has been a major topic in research and industry in recent years. However, due to the computation complexity of pixel-wise prediction and backpropagation algorithm, semantic segmentation has been demanding in computation resources, resulting in slow training and inference speed and large storage space to store models. Existing schemes that speed up segmentation network change the network structure and come with noticeable accuracy degradation. However, neural network quantization can be used to reduce computation load while maintaining comparable accuracy and original network structure. Semantic segmentation networks are different from traditional deep convolutional neural networks (DCNNs) in many ways, and this topic has not been thoroughly explored in existing works. In this paper, we propose a new quantization framework for training and inference of segmentation networks, where parameters and operations are constrained to 8-bit integer-based values for the first time. Full quantization of the data flow and the removal of square and root operations in batch normalization give our framework the ability to perform inference on fixed-point devices. Our proposed framework is evaluated on mainstream semantic segmentation networks like FCN-VGG16 and DeepLabv3-ResNet50, achieving comparable accuracy against floating-point framework on ADE20K dataset and PASCAL VOC 2012 dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes