MLJul 13, 2017

Distributionally Ambiguous Optimization Techniques for Batch Bayesian Optimization

arXiv:1707.04191v410 citations
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in batch Bayesian optimization for researchers and practitioners, offering a more efficient method, though it is incremental as it builds on existing Expected Improvement functions.

The authors tackled the computational challenge of batch Bayesian optimization by proposing a new acquisition function based on distributionally ambiguous optimization, which avoids multi-dimensional integrations and can be computed exactly as a convex optimization problem, leading to superior performance in benchmark and real-world problems.

We propose a novel, theoretically-grounded, acquisition function for Batch Bayesian optimization informed by insights from distributionally ambiguous optimization. Our acquisition function is a lower bound on the well-known Expected Improvement function, which requires evaluation of a Gaussian Expectation over a multivariate piecewise affine function. Our bound is computed instead by evaluating the best-case expectation over all probability distributions consistent with the same mean and variance as the original Gaussian distribution. Unlike alternative approaches, including Expected Improvement, our proposed acquisition function avoids multi-dimensional integrations entirely, and can be computed exactly - even on large batch sizes - as the solution of a tractable convex optimization problem. Our suggested acquisition function can also be optimized efficiently, since first and second derivative information can be calculated inexpensively as by-products of the acquisition function calculation itself. We derive various novel theorems that ground our work theoretically and we demonstrate superior performance via simple motivating examples, benchmark functions and real-world problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes