LG NE MLAug 6, 2019

Cheetah: Mixed Low-Precision Hardware & Software Co-Design Framework for DNNs on the Edge

Hamed F. Langroudi, Zachariah Carmichael, David Pastuch, Dhireesha Kudithipudi

arXiv:1908.02386v16.024 citations

Originality Incremental advance

AI Analysis

This work addresses the need for energy-efficient DNN deployment on resource-constrained edge devices, representing an incremental advancement by extending posit support to training and mixed-precision scenarios.

The paper tackles the problem of enabling efficient DNN training and inference on edge devices by proposing the Cheetah framework, which supports mixed low-precision formats including posits, and shows that 16-bit posits outperform 16-bit floating point in training, while [5..8]-bit posits improve the performance-energy trade-off over float and fixed-point in inference.

Low-precision DNNs have been extensively explored in order to reduce the size of DNN models for edge devices. Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision in [5..8]-bits. However, previous studies were limited to studying posit for DNN inference only. In this paper, we propose the Cheetah framework, which supports both DNN training and inference using posits, as well as other commonly used formats. Additionally, the framework is amenable for different quantization approaches and supports mixed-precision floating point and fixed-point numerical formats. Cheetah is evaluated on three datasets: MNIST, Fashion MNIST, and CIFAR-10. Results indicate that 16-bit posits outperform 16-bit floating point in DNN training. Furthermore, performing inference with [5..8]-bit posits improves the trade-off between performance and energy-delay-product over both [5..8]-bit float and fixed-point.

View on arXiv PDF

Similar