MLApr 12, 2017

Preferential Bayesian Optimization

arXiv:1704.03651v1151 citations
Originality Incremental advance
AI Analysis

This addresses a bottleneck in applications like human preference modeling, though it is an incremental extension of Bayesian optimization methods.

The paper tackles the problem of optimizing black-box functions when only pairwise comparisons (preferences) are available, such as in A/B tests or recommender systems, by introducing Preferential Bayesian Optimization (PBO), which reduces the number of comparisons needed to find the optimum.

Bayesian optimization (BO) has emerged during the last few years as an effective approach to optimizing black-box functions where direct queries of the objective are expensive. In this paper we consider the case where direct access to the function is not possible, but information about user preferences is. Such scenarios arise in problems where human preferences are modeled, such as A/B tests or recommender systems. We present a new framework for this scenario that we call Preferential Bayesian Optimization (PBO) which allows us to find the optimum of a latent function that can only be queried through pairwise comparisons, the so-called duels. PBO extends the applicability of standard BO ideas and generalizes previous discrete dueling approaches by modeling the probability of the winner of each duel by means of a Gaussian process model with a Bernoulli likelihood. The latent preference function is used to define a family of acquisition functions that extend usual policies used in BO. We illustrate the benefits of PBO in a variety of experiments, showing that PBO needs drastically fewer comparisons for finding the optimum. According to our experiments, the way of modeling correlations in PBO is key in obtaining this advantage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes