ML LGMay 28, 2019

Recursive Estimation for Sparse Gaussian Process Regression

Manuel Schürch, Dario Azzimonti, Alessio Benavoli, Marco Zaffalon

arXiv:1905.11711v211.845 citationsHas Code

Originality Highly original

AI Analysis

This work addresses the computational bottleneck in Gaussian Process regression for large datasets, enabling more efficient online learning in applications like time-series analysis or robotics.

The paper tackles the scalability of Gaussian Process regression by connecting sparse inducing point methods to Bayesian recursive estimation, enabling online learning with mini-batches. It achieves faster convergence and superior performance on datasets with up to a million samples compared to state-of-the-art sequential methods.

Gaussian Processes (GPs) are powerful kernelized methods for non-parameteric regression used in many applications. However, their use is limited to a few thousand of training samples due to their cubic time complexity. In order to scale GPs to larger datasets, several sparse approximations based on so-called inducing points have been proposed in the literature. In this work we investigate the connection between a general class of sparse inducing point GP regression methods and Bayesian recursive estimation which enables Kalman Filter like updating for online learning. The majority of previous work has focused on the batch setting, in particular for learning the model parameters and the position of the inducing points, here instead we focus on training with mini-batches. By exploiting the Kalman filter formulation, we propose a novel approach that estimates such parameters by recursively propagating the analytical gradients of the posterior over mini-batches of the data. Compared to state of the art methods, our method keeps analytic updates for the mean and covariance of the posterior, thus reducing drastically the size of the optimization problem. We show that our method achieves faster convergence and superior performance compared to state of the art sequential Gaussian Process regression on synthetic GP as well as real-world data with up to a million of data samples.

View on arXiv PDF Code

Similar