MLLGOct 14, 2024

Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal

Oxford
arXiv:2410.10384v36 citationsh-index: 8NIPS
Originality Highly original
AI Analysis

This work addresses a key bottleneck in BO for optimizing black-box functions, offering improved theoretical guarantees and empirical performance, though it is incremental over prior solutions like A-GP-UCB.

The paper tackles the problem of Bayesian Optimization (BO) requiring a pre-specified length scale hyperparameter, which can lead to misspecification and over-exploration, by introducing Length scale Balancing (LB) to aggregate multiple surrogate models with varying length scales, achieving a cumulative regret bound only logarithmically away from an oracle's optimal regret and outperforming existing methods in benchmarks.

Bayesian Optimization (BO) is widely used for optimising black-box functions but requires us to specify the length scale hyperparameter, which defines the smoothness of the functions the optimizer will consider. Most current BO algorithms choose this hyperparameter by maximizing the marginal likelihood of the observed data, albeit risking misspecification if the objective function is less smooth in regions we have not yet explored. The only prior solution addressing this problem with theoretical guarantees was A-GP-UCB, proposed by Berkenkamp et al. (2019). This algorithm progressively decreases the length scale, expanding the class of functions considered by the optimizer. However, A-GP-UCB lacks a stopping mechanism, leading to over-exploration and slow convergence. To overcome this, we introduce Length scale Balancing (LB) - a novel approach, aggregating multiple base surrogate models with varying length scales. LB intermittently adds smaller length scale candidate values while retaining longer scales, balancing exploration and exploitation. We formally derive a cumulative regret bound of LB and compare it with the regret of an oracle BO algorithm using the optimal length scale. Denoting the factor by which the regret bound of A-GP-UCB was away from oracle as $g(T)$, we show that LB is only $\log g(T)$ away from oracle regret. We also empirically evaluate our algorithm on synthetic and real-world benchmarks and show it outperforms A-GP-UCB, maximum likelihood estimation and MCMC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes