CVApr 9, 2019

FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge

arXiv:1904.04421v1187 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient DNN acceleration for IoT edge devices, offering a novel co-design approach that is incremental in combining existing techniques.

The paper tackles the challenge of deploying DNNs on resource-constrained edge FPGAs by proposing a simultaneous FPGA/DNN co-design methodology, resulting in a design that outperforms state-of-the-art FPGA solutions with 6.2% higher IoU, 2.48X higher FPS, 40% lower power consumption, and 2.5X higher energy efficiency.

While embedded FPGAs are attractive platforms for DNN acceleration on edge-devices due to their low latency and high energy efficiency, the scarcity of resources of edge-scale FPGA devices also makes it challenging for DNN deployment. In this paper, we propose a simultaneous FPGA/DNN co-design methodology with both bottom-up and top-down approaches: a bottom-up hardware-oriented DNN model search for high accuracy, and a top-down FPGA accelerator design considering DNN-specific characteristics. We also build an automatic co-design flow, including an Auto-DNN engine to perform hardware-oriented DNN model search, as well as an Auto-HLS engine to generate synthesizable C code of the FPGA accelerator for explored DNNs. We demonstrate our co-design approach on an object detection task using PYNQ-Z1 FPGA. Results show that our proposed DNN model and accelerator outperform the state-of-the-art FPGA designs in all aspects including Intersection-over-Union (IoU) (6.2% higher), frames per second (FPS) (2.48X higher), power consumption (40% lower), and energy efficiency (2.5X higher). Compared to GPU-based solutions, our designs deliver similar accuracy but consume far less energy.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes