LGMLMay 28, 2019

CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks

arXiv:1905.11669v15 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the problem of deploying efficient CNNs on mobile and embedded devices, offering an incremental improvement over existing methods.

The authors tackled the challenge of implementing CNN models on resource-limited platforms by proposing CompactNet, which automatically optimizes pre-trained models for specific platforms, achieving up to a 1.8x speedup in kernel computation while maintaining or improving accuracy on Cifar-10.

Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet\footnote{Project URL: \url{https://github.com/CompactNet/CompactNet}}, which automatically optimizes a pre-trained CNN model on a specific resource-limited platform given a specific target of inference speedup. Guided by a simulator of the target platform, CompactNet progressively trims a pre-trained network by removing certain redundant filters until the target speedup is reached and generates an optimal platform-specific model while maintaining the accuracy. We evaluate our work on two platforms of a mobile ARM CPU and a machine learning accelerator NPU (Cambricon-1A ISA) on a Huawei Mate10 smartphone. For the state-of-the-art slim CNN model made for the embedded platform, MobileNetV2, CompactNet achieves up to a 1.8x kernel computation speedup with equal or even higher accuracy for image classification tasks on the Cifar-10 dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes