DCARLGFeb 24, 2023

Map-and-Conquer: Energy-Efficient Mapping of Dynamic Neural Nets onto Heterogeneous MPSoCs

arXiv:2302.12926v116 citationsh-index: 40
Originality Incremental advance
AI Analysis

This addresses energy and latency inefficiencies in deploying neural networks on heterogeneous hardware for embedded and edge computing applications, representing an incremental improvement over existing mapping strategies.

The paper tackles the problem of mapping neural networks onto heterogeneous MPSoCs to improve energy efficiency and latency, achieving a 2.1x increase in energy efficiency and 1.7x reduction in latency compared to baseline methods.

Heterogeneous MPSoCs comprise diverse processing units of varying compute capabilities. To date, the mapping strategies of neural networks (NNs) onto such systems are yet to exploit the full potential of processing parallelism, made possible through both the intrinsic NNs' structure and underlying hardware composition. In this paper, we propose a novel framework to effectively map NNs onto heterogeneous MPSoCs in a manner that enables them to leverage the underlying processing concurrency. Specifically, our approach identifies an optimal partitioning scheme of the NN along its `width' dimension, which facilitates deployment of concurrent NN blocks onto different hardware computing units. Additionally, our approach contributes a novel scheme to deploy partitioned NNs onto the MPSoC as dynamic multi-exit networks for additional performance gains. Our experiments on a standard MPSoC platform have yielded dynamic mapping configurations that are 2.1x more energy-efficient than the GPU-only mapping while incurring 1.7x less latency than DLA-only mapping.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes