LGMEMLFeb 12, 2025

New Bounds for Sparse Variational Gaussian Processes

arXiv:2502.08730v22 citationsh-index: 34ICML
AI Analysis

This work addresses bias issues in sparse GP approximations, offering an incremental improvement for practitioners in machine learning and statistics.

The paper tackles the problem of bias in sparse variational Gaussian processes by relaxing a fundamental assumption in the variational distribution, introducing extra parameters that allow for a tighter evidence lower bound. This results in improved predictive performance and reduced hyperparameter bias, as demonstrated on several datasets.

Sparse variational Gaussian processes (GPs) construct tractable posterior approximations to GP models. At the core of these methods is the assumption that the true posterior distribution over training function values ${\bf f}$ and inducing variables ${\bf u}$ is approximated by a variational distribution that incorporates the conditional GP prior $p({\bf f} | {\bf u})$ in its factorization. While this assumption is considered as fundamental, we show that for model training we can relax it through the use of a more general variational distribution $q({\bf f} | {\bf u})$ that depends on $N$ extra parameters, where $N$ is the number of training examples. In GP regression, we can analytically optimize the evidence lower bound over the extra parameters and express a tractable collapsed bound that is tighter than the previous bound. The new bound is also amenable to stochastic optimization and its implementation requires minor modifications to existing sparse GP code. Further, we also describe extensions to non-Gaussian likelihoods. On several datasets we demonstrate that our method can reduce bias when learning the hyperparameters and can lead to better predictive performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes