MLLGNov 3, 2018

Large-scale Heteroscedastic Regression via Gaussian Process

arXiv:1811.01179v332 citations
Originality Incremental advance
AI Analysis

This addresses the problem of applying heteroscedastic regression to large datasets for researchers and practitioners in machine learning and statistics, representing an incremental improvement in scalability.

The paper tackles the scalability issue of heteroscedastic Gaussian process regression, which has cubic time complexity limiting big data applications, by developing variational sparse inference algorithms (VSHGP, SVSHGP, DVSHGP) that improve efficiency and performance, as verified on various datasets.

Heteroscedastic regression considering the varying noises among observations has many applications in the fields like machine learning and statistics. Here we focus on the heteroscedastic Gaussian process (HGP) regression which integrates the latent function and the noise function together in a unified non-parametric Bayesian framework. Though showing remarkable performance, HGP suffers from the cubic time complexity, which strictly limits its application to big data. To improve the scalability, we first develop a variational sparse inference algorithm, named VSHGP, to handle large-scale datasets. Furthermore, two variants are developed to improve the scalability and capability of VSHGP. The first is stochastic VSHGP (SVSHGP) which derives a factorized evidence lower bound, thus enhancing efficient stochastic variational inference. The second is distributed VSHGP (DVSHGP) which (i) follows the Bayesian committee machine formalism to distribute computations over multiple local VSHGP experts with many inducing points; and (ii) adopts hybrid parameters for experts to guard against over-fitting and capture local variety. The superiority of DVSHGP and SVSHGP as compared to existing scalable heteroscedastic/homoscedastic GPs is then extensively verified on various datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes