MLLGFeb 7, 2025

Tighter sparse variational Gaussian processes

arXiv:2502.04750v24 citationsh-index: 18Trans. Mach. Learn. Res.
AI Analysis

This provides an incremental improvement for scaling Gaussian processes to large datasets in machine learning applications.

The paper tackles the problem of improving sparse variational Gaussian process approximations by relaxing a standard conditional posterior assumption, resulting in a provably tighter bound that matches or outperforms standard methods on regression, classification, and latent variable models while maintaining computational efficiency.

Sparse variational Gaussian process (GP) approximations based on inducing points have become the de facto standard for scaling GPs to large datasets, owing to their theoretical elegance, computational efficiency, and ease of implementation. This paper introduces a provably tighter variational approximation by relaxing the standard assumption that the conditional approximate posterior given the inducing points must match that in the prior. The key innovation is to modify the conditional posterior to have smaller variances than that of the prior at the training points. We derive the collapsed bound for the regression case, describe how to use the proposed approximation in large data settings, and discuss its application to handle orthogonally structured inducing points and GP latent variable models. Extensive experiments on regression benchmarks, classification, and latent variable models demonstrate that the proposed approximation consistently matches or outperforms standard sparse variational GPs while maintaining the same computational cost. An implementation will be made available in all popular GP packages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes