LGCYSEJul 7, 2023

Estimating Deep Learning energy consumption based on model architecture and training environment

arXiv:2307.05520v57 citationsh-index: 45
Originality Incremental advance
AI Analysis

It addresses the environmental impact of deep learning for researchers and practitioners by providing more accurate estimation methods, though it is incremental as it builds on prior energy awareness studies.

This work tackles the problem of inaccurate energy consumption estimates in deep learning training by analyzing how model architecture and training environment affect energy use, finding that optimal combinations can reduce training energy by up to 80.68% with minimal accuracy loss.

To raise awareness of the environmental impact of deep learning (DL), many studies estimate the energy use of DL systems. However, energy estimates during DL training often rely on unverified assumptions. This work addresses that gap by investigating how model architecture and training environment affect energy consumption. We train a variety of computer vision models and collect energy consumption and accuracy metrics to analyze their trade-offs across configurations. Our results show that selecting the right model-training environment combination can reduce training energy consumption by up to 80.68% with less than 2% loss in $F_1$ score. We find a significant interaction effect between model and training environment: energy efficiency improves when GPU computational power scales with model complexity. Moreover, we demonstrate that common estimation practices, such as using FLOPs or GPU TDP, fail to capture these dynamics and can lead to substantial errors. To address these shortcomings, we propose the Stable Training Epoch Projection (STEP) and the Pre-training Regression-based Estimation (PRE) methods. Across evaluations, our methods outperform existing tools by a factor of two or more in estimation accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes