LGMLFeb 3, 2023

A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization

arXiv:2302.01539v310 citationsh-index: 7
Originality Highly original
AI Analysis

This addresses the problem of efficient and theoretically grounded HPO for machine learning practitioners, offering a novel method with strong empirical gains.

The paper tackles hyperparameter optimization (HPO) by introducing BLiE, a Lipschitz-bandit-based algorithm that assumes only Lipschitz continuity, and shows it outperforms state-of-the-art methods on benchmarks and improves diffusion model sampling speed.

One of the most critical problems in machine learning is HyperParameter Optimization (HPO), since choice of hyperparameters has a significant impact on final model performance. Although there are many HPO algorithms, they either have no theoretical guarantees or require strong assumptions. To this end, we introduce BLiE -- a Lipschitz-bandit-based algorithm for HPO that only assumes Lipschitz continuity of the objective function. BLiE exploits the landscape of the objective function to adaptively search over the hyperparameter space. Theoretically, we show that $(i)$ BLiE finds an $ε$-optimal hyperparameter with $\mathcal{O} \left( ε^{-(d_z + β)}\right)$ total budgets, where $d_z$ and $β$ are problem intrinsic; $(ii)$ BLiE is highly parallelizable. Empirically, we demonstrate that BLiE outperforms the state-of-the-art HPO algorithms on benchmark tasks. We also apply BLiE to search for noise schedule of diffusion models. Comparison with the default schedule shows that BLiE schedule greatly improves the sampling speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes