MLLGMay 27, 2022

Surrogate modeling for Bayesian optimization beyond a single Gaussian process

arXiv:2205.14090v157 citationsh-index: 141
Originality Incremental advance
AI Analysis

This is an incremental improvement for researchers and practitioners in fields like hyperparameter tuning and drug discovery, as it enhances Bayesian optimization without requiring kernel design.

The paper tackles the limitation of using a single Gaussian process in Bayesian optimization by proposing an ensemble of GPs with Thompson sampling, which improves expressiveness and scalability, achieving competitive performance in synthetic and real-world tests.

Bayesian optimization (BO) has well-documented merits for optimizing black-box functions with an expensive evaluation cost. Such functions emerge in applications as diverse as hyperparameter tuning, drug discovery, and robotics. BO hinges on a Bayesian surrogate model to sequentially select query points so as to balance exploration with exploitation of the search space. Most existing works rely on a single Gaussian process (GP) based surrogate model, where the kernel function form is typically preselected using domain knowledge. To bypass such a design process, this paper leverages an ensemble (E) of GPs to adaptively select the surrogate model fit on-the-fly, yielding a GP mixture posterior with enhanced expressiveness for the sought function. Acquisition of the next evaluation input using this EGP-based function posterior is then enabled by Thompson sampling (TS) that requires no additional design parameters. To endow function sampling with scalability, random feature-based kernel approximation is leveraged per GP model. The novel EGP-TS readily accommodates parallel operation. To further establish convergence of the proposed EGP-TS to the global optimum, analysis is conducted based on the notion of Bayesian regret for both sequential and parallel settings. Tests on synthetic functions and real-world applications showcase the merits of the proposed method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes