LGROMLJun 16, 2020

Real-Time Regression with Dividing Local Gaussian Processes

arXiv:2006.09446v213 citations
AI Analysis

This addresses the problem of enabling real-time regression for big data users, though it is an incremental improvement over existing Gaussian process methods.

The paper tackles the computational inefficiency of exact Gaussian process regression for large datasets in real-time applications by proposing dividing local Gaussian processes, achieving sublinear computational complexity and outperforming state-of-the-art methods in accuracy and speed.

The increased demand for online prediction and the growing availability of large data sets drives the need for computationally efficient models. While exact Gaussian process regression shows various favorable theoretical properties (uncertainty estimate, unlimited expressive power), the poor scaling with respect to the training set size prohibits its application in big data regimes in real-time. Therefore, this paper proposes dividing local Gaussian processes, which are a novel, computationally efficient modeling approach based on Gaussian process regression. Due to an iterative, data-driven division of the input space, they achieve a sublinear computational complexity in the total number of training points in practice, while providing excellent predictive distributions. A numerical evaluation on real-world data sets shows their advantages over other state-of-the-art methods in terms of accuracy as well as prediction and update speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes