LGNov 13, 2016

Batched Gaussian Process Bandit Optimization via Determinantal Point Processes

arXiv:1611.04088v1107 citations
Originality Highly original
AI Analysis

This addresses the need for efficient parallel processing in hyper-parameter optimization and robotics, offering an incremental improvement over existing batch methods.

The paper tackles the problem of expensive batch evaluations in Bayesian optimization by proposing a method that uses Determinantal Point Processes (DPPs) to model batch diversity, with experiments showing it outperforms state-of-the-art methods on synthetic and real-world tasks.

Gaussian Process bandit optimization has emerged as a powerful tool for optimizing noisy black box functions. One example in machine learning is hyper-parameter optimization where each evaluation of the target function requires training a model which may involve days or even weeks of computation. Most methods for this so-called "Bayesian optimization" only allow sequential exploration of the parameter space. However, it is often desirable to propose batches or sets of parameter values to explore simultaneously, especially when there are large parallel processing facilities at our disposal. Batch methods require modeling the interaction between the different evaluations in the batch, which can be expensive in complex scenarios. In this paper, we propose a new approach for parallelizing Bayesian optimization by modeling the diversity of a batch via Determinantal point processes (DPPs) whose kernels are learned automatically. This allows us to generalize a previous result as well as prove better regret bounds based on DPP sampling. Our experiments on a variety of synthetic and real-world robotics and hyper-parameter optimization tasks indicate that our DPP-based methods, especially those based on DPP sampling, outperform state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes