MLJun 22, 2017

Scalable Multi-Class Gaussian Process Classification using Expectation Propagation

arXiv:1706.07258v120 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in multi-class Gaussian process classification for large datasets, representing an incremental improvement over existing approximation methods.

The paper tackles the challenge of scaling Gaussian process classification to large datasets by proposing an expectation propagation method that enables efficient training with stochastic gradients and mini-batches, achieving computational and memory costs independent of dataset size and performing similarly or better than variational inference alternatives in terms of test log-likelihood and convergence speed.

This paper describes an expectation propagation (EP) method for multi-class classification with Gaussian processes that scales well to very large datasets. In such a method the estimate of the log-marginal-likelihood involves a sum across the data instances. This enables efficient training using stochastic gradients and mini-batches. When this type of training is used, the computational cost does not depend on the number of data instances $N$. Furthermore, extra assumptions in the approximate inference process make the memory cost independent of $N$. The consequence is that the proposed EP method can be used on datasets with millions of instances. We compare empirically this method with alternative approaches that approximate the required computations using variational inference. The results show that it performs similar or even better than these techniques, which sometimes give significantly worse predictive distributions in terms of the test log-likelihood. Besides this, the training process of the proposed approach also seems to converge in a smaller number of iterations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes