MLLGJun 16, 2014

Freeze-Thaw Bayesian Optimization

arXiv:1406.3896v1289 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of hyperparameter optimization for machine learning practitioners, though it appears incremental as it builds on existing Bayesian optimization techniques.

The paper tackles the problem of efficiently finding good hyperparameter settings for machine learning models by developing a dynamic Bayesian optimization method that uses partial training information to decide whether to pause or resume model training, resulting in an approach shown to be extremely effective in experiments.

In this paper we develop a dynamic form of Bayesian optimization for machine learning models with the goal of rapidly finding good hyperparameter settings. Our method uses the partial information gained during the training of a machine learning model in order to decide whether to pause training and start a new model, or resume the training of a previously-considered model. We specifically tailor our method to machine learning problems by developing a novel positive-definite covariance kernel to capture a variety of training curves. Furthermore, we develop a Gaussian process prior that scales gracefully with additional temporal observations. Finally, we provide an information-theoretic framework to automate the decision process. Experiments on several common machine learning models show that our approach is extremely effective in practice.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes