LG MLDec 12, 2017

Practical Bayesian optimization in the presence of outliers

Ruben Martinez-Cantin, Kevin Tee, Michael McCourt

arXiv:1712.04567v114.662 citations

Originality Incremental advance

AI Analysis

This addresses a practical issue in Bayesian optimization for researchers and practitioners dealing with noisy or outlier-prone data, though it is incremental as it builds on existing robust regression methods.

The paper tackles the problem of Bayesian optimization being biased by outliers in data, and proposes a new algorithm combining robust regression with outlier diagnostics and a scheduler, which improves efficiency and convergence over standard robust regression.

Inference in the presence of outliers is an important field of research as outliers are ubiquitous and may arise across a variety of problems and domains. Bayesian optimization is method that heavily relies on probabilistic inference. This allows outstanding sample efficiency because the probabilistic machinery provides a memory of the whole optimization process. However, that virtue becomes a disadvantage when the memory is populated with outliers, inducing bias in the estimation. In this paper, we present an empirical evaluation of Bayesian optimization methods in the presence of outliers. The empirical evidence shows that Bayesian optimization with robust regression often produces suboptimal results. We then propose a new algorithm which combines robust regression (a Gaussian process with Student-t likelihood) with outlier diagnostics to classify data points as outliers or inliers. By using an scheduler for the classification of outliers, our method is more efficient and has better convergence over the standard robust regression. Furthermore, we show that even in controlled situations with no expected outliers, our method is able to produce better results.

View on arXiv PDF

Similar