LGMay 20, 2022

Bayesian Active Learning with Fully Bayesian Gaussian Processes

Christoffer Riis, Francisco Antunes, Frederik Boe Hüttel, Carlos Lima Azevedo, Francisco Câmara Pereira

arXiv:2205.10186v318.141 citationsh-index: 16Has Code

Originality Incremental advance

AI Analysis

This work addresses inefficient querying in active learning for scenarios with scarce labeled data, representing an incremental improvement in method design.

The paper tackled the bias-variance trade-off in active learning with Gaussian Processes by designing two new acquisition functions, B-QBC and QB-MGP, which improved marginal likelihood and predictive performance across six simulators.

The bias-variance trade-off is a well-known problem in machine learning that only gets more pronounced the less available data there is. In active learning, where labeled data is scarce or difficult to obtain, neglecting this trade-off can cause inefficient and non-optimal querying, leading to unnecessary data labeling. In this paper, we focus on active learning with Gaussian Processes (GPs). For the GP, the bias-variance trade-off is made by optimization of the two hyperparameters: the length scale and noise-term. Considering that the optimal mode of the joint posterior of the hyperparameters is equivalent to the optimal bias-variance trade-off, we approximate this joint posterior and utilize it to design two new acquisition functions. The first one is a Bayesian variant of Query-by-Committee (B-QBC), and the second is an extension that explicitly minimizes the predictive variance through a Query by Mixture of Gaussian Processes (QB-MGP) formulation. Across six simulators, we empirically show that B-QBC, on average, achieves the best marginal likelihood, whereas QB-MGP achieves the best predictive performance. We show that incorporating the bias-variance trade-off in the acquisition functions mitigates unnecessary and expensive data labeling.

View on arXiv PDF Code

Similar