MLLGJun 14, 2022

Deep Variational Implicit Processes

arXiv:2206.06720v26 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses the need for more flexible Bayesian nonparametric models with well-calibrated uncertainty, though it appears incremental as an extension of existing IP and deep GP frameworks.

The authors tackled the limited expressiveness of implicit processes (IPs) by proposing Deep Variational Implicit Processes (DVIP), a multi-layer generalization that outperforms previous IP-based methods and deep GPs in regression and classification experiments, scaling to datasets with millions of instances.

Implicit processes (IPs) are a generalization of Gaussian processes (GPs). IPs may lack a closed-form expression but are easy to sample from. Examples include, among others, Bayesian neural networks or neural samplers. IPs can be used as priors over functions, resulting in flexible models with well-calibrated prediction uncertainty estimates. Methods based on IPs usually carry out function-space approximate inference, which overcomes some of the difficulties of parameter-space approximate inference. Nevertheless, the approximations employed often limit the expressiveness of the final model, resulting, e.g., in a Gaussian predictive distribution, which can be restrictive. We propose here a multi-layer generalization of IPs called the Deep Variational Implicit process (DVIP). This generalization is similar to that of deep GPs over GPs, but it is more flexible due to the use of IPs as the prior distribution over the latent functions. We describe a scalable variational inference algorithm for training DVIP and show that it outperforms previous IP-based methods and also deep GPs. We support these claims via extensive regression and classification experiments. We also evaluate DVIP on large datasets with up to several million data instances to illustrate its good scalability and performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes