LGPFDec 3, 2020

ResPerfNet: Deep Residual Learning for Regressional Performance Modeling of Deep Neural Networks

arXiv:2012.01671v1
AI Analysis

This work provides a method for deep learning practitioners to more accurately predict the performance of deep neural networks on specific computing platforms, which can aid in design space exploration.

This paper addresses the challenge of predicting the performance of deep neural networks on various computing infrastructures. The authors propose ResPerfNet, a residual neural network that predicts execution time for individual layers and full network models, achieving a mean absolute percentage error of 8.4% for LeNet, AlexNet, and VGG16 on an NVIDIA GTX 1080Ti.

The rapid advancements of computing technology facilitate the development of diverse deep learning applications. Unfortunately, the efficiency of parallel computing infrastructures varies widely with neural network models, which hinders the exploration of the design space to find high-performance neural network architectures on specific computing platforms for a given application. To address such a challenge, we propose a deep learning-based method, ResPerfNet, which trains a residual neural network with representative datasets obtained on the target platform to predict the performance for a deep neural network. Our experimental results show that ResPerfNet can accurately predict the execution time of individual neural network layers and full network models on a variety of platforms. In particular, ResPerfNet achieves 8.4% of mean absolute percentage error for LeNet, AlexNet and VGG16 on the NVIDIA GTX 1080Ti, which is substantially lower than the previously published works.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes