MLOct 19, 2015

Optimization for Gaussian Processes via Chaining

arXiv:1510.05576v18 citations
Originality Incremental advance
AI Analysis

This work addresses optimization in machine learning for scenarios with limited feedback, but it is incremental as it builds on an existing algorithm with broader applicability.

The paper tackles the problem of stochastic optimization under bandit feedback by generalizing the GP-UCB algorithm to arbitrary kernels and search spaces using localized chaining, achieving theoretical regret bounds with the same convergence rates as GP-UCB and showing empirical efficiency over competitors.

In this paper, we consider the problem of stochastic optimization under a bandit feedback model. We generalize the GP-UCB algorithm [Srinivas and al., 2012] to arbitrary kernels and search spaces. To do so, we use a notion of localized chaining to control the supremum of a Gaussian process, and provide a novel optimization scheme based on the computation of covering numbers. The theoretical bounds we obtain on the cumulative regret are more generic and present the same convergence rates as the GP-UCB algorithm. Finally, the algorithm is shown to be empirically more efficient than its natural competitors on simple and complex input spaces.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes