LGDec 20, 2024

PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time

arXiv:2412.15519v21 citationsh-index: 27ICPE
Originality Incremental advance
AI Analysis

This addresses the optimization challenge for researchers and practitioners training large models like LLMs, though it appears incremental as an enhancement to existing prediction methodologies.

The paper tackles the problem of predicting deep neural network training time to optimize computational costs, introducing PreNeT, which achieves up to 72% improvement in prediction accuracy over state-of-the-art methods.

Training deep learning models, particularly Transformer-based architectures such as Large Language Models (LLMs), demands substantial computational resources and extended training periods. While optimal configuration and infrastructure selection can significantly reduce associated costs, this optimization requires preliminary analysis tools. This paper introduces PreNeT, a novel predictive framework designed to address this optimization challenge. PreNeT facilitates training optimization by integrating comprehensive computational metrics, including layer-specific parameters, arithmetic operations and memory utilization. A key feature of PreNeT is its capacity to accurately predict training duration on previously unexamined hardware infrastructures, including novel accelerator architectures. This framework employs a sophisticated approach to capture and analyze the distinct characteristics of various neural network layers, thereby enhancing existing prediction methodologies. Through proactive implementation of PreNeT, researchers and practitioners can determine optimal configurations, parameter settings, and hardware specifications to maximize cost-efficiency and minimize training duration. Experimental results demonstrate that PreNeT achieves up to 72% improvement in prediction accuracy compared to contemporary state-of-the-art frameworks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes