MLLGJun 4, 2020

Quadruply Stochastic Gaussian Processes

arXiv:2006.03015v14 citations
Originality Highly original
AI Analysis

This enables scalable inference for Gaussian processes on massive datasets, which is incremental but addresses a known bottleneck in machine learning.

The paper tackles the problem of scaling Gaussian process models to huge datasets by introducing a stochastic variational inference procedure with per-iteration complexity independent of the number of training points and basis functions, achieving accurate inference on large datasets with up to 10^7 basis functions.

We introduce a stochastic variational inference procedure for training scalable Gaussian process (GP) models whose per-iteration complexity is independent of both the number of training points, $n$, and the number basis functions used in the kernel approximation, $m$. Our central contributions include an unbiased stochastic estimator of the evidence lower bound (ELBO) for a Gaussian likelihood, as well as a stochastic estimator that lower bounds the ELBO for several other likelihoods such as Laplace and logistic. Independence of the stochastic optimization update complexity on $n$ and $m$ enables inference on huge datasets using large capacity GP models. We demonstrate accurate inference on large classification and regression datasets using GPs and relevance vector machines with up to $m = 10^7$ basis functions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes