CVFeb 11, 2018

Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms

arXiv:1802.03835v1160 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient DNN inference for IoT applications, though it appears incremental as it builds on existing partitioning and encoding ideas.

This paper tackles the problem of deploying deep neural network inference on resource-constrained IoT platforms by partitioning the task between edge and host, using feature space encoding to reduce energy and increase throughput. Simulation results show significant improvements in energy-efficiency and throughput over baseline configurations.

This paper introduces partitioning an inference task of a deep neural network between an edge and a host platform in the IoT environment. We present a DNN as an encoding pipeline, and propose to transmit the output feature space of an intermediate layer to the host. The lossless or lossy encoding of the feature space is proposed to enhance the maximum input rate supported by the edge platform and/or reduce the energy of the edge platform. Simulation results show that partitioning a DNN at the end of convolutional (feature extraction) layers coupled with feature space encoding enables significant improvement in the energy-efficiency and throughput over the baseline configurations that perform the entire inference at the edge or at the host.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes