LG ME MLNov 25, 2021

Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

Vaidotas Simkus, Benjamin Rhodes, Michael U. Gutmann

arXiv:2111.13180v45.59 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a critical issue in machine learning for practitioners dealing with real-world datasets plagued by missing data, offering a general-purpose solution that is incremental over existing variational inference methods.

The paper tackles the problem of estimating statistical models from incomplete data, where standard variational inference is intractable due to exponentially many conditional distributions, by introducing variational Gibbs inference (VGI), which achieves competitive or better performance than existing model-specific methods on synthetic and real-world tasks.

Statistical models are central to machine learning with broad applicability across a range of downstream tasks. The models are controlled by free parameters that are typically estimated from data by maximum-likelihood estimation or approximations thereof. However, when faced with real-world data sets many of the models run into a critical issue: they are formulated in terms of fully-observed data, whereas in practice the data sets are plagued with missing data. The theory of statistical model estimation from incomplete data is conceptually similar to the estimation of latent-variable models, where powerful tools such as variational inference (VI) exist. However, in contrast to standard latent-variable models, parameter estimation with incomplete data often requires estimating exponentially-many conditional distributions of the missing variables, hence making standard VI methods intractable. We address this gap by introducing variational Gibbs inference (VGI), a new general-purpose method to estimate the parameters of statistical models from incomplete data. We validate VGI on a set of synthetic and real-world estimation tasks, estimating important machine learning models such as variational autoencoders and normalising flows from incomplete data. The proposed method, whilst general-purpose, achieves competitive or better performance than existing model-specific estimation methods.

View on arXiv PDF Code

Similar