LGDCMLOct 25, 2019

An End-to-End HW/SW Co-Design Methodology to Design Efficient Deep Neural Network Systems using Virtual Models

arXiv:1910.11632v22 citations
Originality Synthesis-oriented
AI Analysis

This work addresses efficiency improvements for designers of DNN systems, though it is incremental as it builds on existing virtual modeling and compiler techniques.

The paper tackles the challenge of reducing turn-around time for evaluating hardware/software design choices in deep neural network systems by proposing a methodology that uses virtual hardware models for performance estimation during the concept phase, achieving up to 92% accuracy in predicting DNN inference processing time.

End-to-end performance estimation and measurement of deep neural network (DNN) systems become more important with increasing complexity of DNN systems consisting of hardware and software components. The methodology proposed in this paper aims at a reduced turn-around time for evaluating different design choices of hardware and software components of DNN systems. This reduction is achieved by moving the performance estimation from the implementation phase to the concept phase by employing virtual hardware models instead of gathering measurement results from physical prototypes. Deep learning compilers introduce hardware-specific transformations and are, therefore, considered a part of the design flow of virtual system models to extract end-to-end performance estimations. To validate the run-time accuracy of the proposed methodology, a system processing the DilatedVGG DNN is realized both as virtual system model and as hardware implementation. The results show that up to 92 % accuracy can be reached in predicting the processing time of the DNN inference.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes