LGAIFeb 20, 2022

Supervising the Multi-Fidelity Race of Hyperparameter Configurations

arXiv:2202.09774v222 citations
AI Analysis

This addresses the efficiency of hyperparameter tuning for deep learning practitioners, representing an incremental improvement over existing multi-fidelity methods.

The paper tackles the problem of sub-optimal budget allocation in multi-fidelity hyperparameter optimization for deep learning by introducing DyHPO, a Bayesian optimization method that dynamically races configurations, resulting in significant superiority over state-of-the-art methods across 50 datasets and diverse architectures.

Multi-fidelity (gray-box) hyperparameter optimization techniques (HPO) have recently emerged as a promising direction for tuning Deep Learning methods. However, existing methods suffer from a sub-optimal allocation of the HPO budget to the hyperparameter configurations. In this work, we introduce DyHPO, a Bayesian Optimization method that learns to decide which hyperparameter configuration to train further in a dynamic race among all feasible configurations. We propose a new deep kernel for Gaussian Processes that embeds the learning curve dynamics, and an acquisition function that incorporates multi-budget information. We demonstrate the significant superiority of DyHPO against state-of-the-art hyperparameter optimization methods through large-scale experiments comprising 50 datasets (Tabular, Image, NLP) and diverse architectures (MLP, CNN/NAS, RNN).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes