MLLGMar 6, 2020

Sparse Gaussian Processes Revisited: Bayesian Approaches to Inducing-Variable Approximations

arXiv:2003.03080v413 citations
Originality Incremental advance
AI Analysis

This work addresses scalability and robustness issues in Gaussian process models for machine learning practitioners, offering an incremental improvement over existing variational methods.

The paper challenges the common practice of optimizing inducing inputs in variational inference for Gaussian processes, showing that a fully Bayesian treatment of both inducing locations and hyperparameters significantly improves performance. It demonstrates state-of-the-art results across regression and classification tasks using stochastic gradient Hamiltonian Monte Carlo.

Variational inference techniques based on inducing variables provide an elegant framework for scalable posterior estimation in Gaussian process (GP) models. Besides enabling scalability, one of their main advantages over sparse approximations using direct marginal likelihood maximization is that they provide a robust alternative for point estimation of the inducing inputs, i.e. the location of the inducing variables. In this work we challenge the common wisdom that optimizing the inducing inputs in the variational framework yields optimal performance. We show that, by revisiting old model approximations such as the fully-independent training conditionals endowed with powerful sampling-based inference methods, treating both inducing locations and GP hyper-parameters in a Bayesian way can improve performance significantly. Based on stochastic gradient Hamiltonian Monte Carlo, we develop a fully Bayesian approach to scalable GP and deep GP models, and demonstrate its state-of-the-art performance through an extensive experimental campaign across several regression and classification problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes