MLLGJul 5, 2018

Scalable Gaussian Processes with Grid-Structured Eigenfunctions (GP-GRIEF)

arXiv:1807.02125v227 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in Gaussian processes for large datasets, though it is incremental as it builds on existing kernel approximation methods.

The authors tackled the computational bottleneck of Gaussian process training by introducing a kernel approximation strategy that reduces time complexity to O(p), enabling Bayesian inference on large-scale datasets with up to two million training points and 10^33 inducing points.

We introduce a kernel approximation strategy that enables computation of the Gaussian process log marginal likelihood and all hyperparameter derivatives in $\mathcal{O}(p)$ time. Our GRIEF kernel consists of $p$ eigenfunctions found using a Nystrom approximation from a dense Cartesian product grid of inducing points. By exploiting algebraic properties of Kronecker and Khatri-Rao tensor products, computational complexity of the training procedure can be practically independent of the number of inducing points. This allows us to use arbitrarily many inducing points to achieve a globally accurate kernel approximation, even in high-dimensional problems. The fast likelihood evaluation enables type-I or II Bayesian inference on large-scale datasets. We benchmark our algorithms on real-world problems with up to two-million training points and $10^{33}$ inducing points.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes