DCLGPFMLJun 5, 2020

Daydream: Accurately Estimating the Efficacy of Optimizations for DNN Training

arXiv:2006.03318v167 citations
Originality Incremental advance
AI Analysis

This addresses a critical limitation for ML practitioners and system developers by enabling efficient exploration of optimization impacts without costly implementation, though it is incremental as it builds on existing profiling and modeling techniques.

The paper tackles the problem of predicting the performance impact of software-level optimizations in deep neural network (DNN) training across diverse configurations, and introduces Daydream, a profiling tool that accurately models and predicts optimization efficacy, achieving significant performance improvements.

Modern deep neural network (DNN) training jobs use complex and heterogeneous software/hardware stacks. The efficacy of software-level optimizations can vary significantly when used in different deployment configurations. It is onerous and error-prone for ML practitioners and system developers to implement each optimization separately, and determine which ones will improve performance in their own configurations. Unfortunately, existing profiling tools do not aim to answer predictive questions such as "How will optimization X affect the performance of my model?". We address this critical limitation, and proposes a new profiling tool, Daydream, to help programmers efficiently explore the efficacy of DNN optimizations. Daydream models DNN execution with a fine-grained dependency graph based on low-level traces collected by CUPTI, and predicts runtime by simulating execution based on the dependency graph. Daydream maps the low-level traces using DNN domain-specific knowledge, and introduces a set of graph-transformation primitives that can easily model a wide variety of optimizations. We show that Daydream is able to model most mainstream DNN optimization techniques, and accurately predict the efficacy of optimizations that will result in significant performance improvements.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes