LG MLMar 11, 2019

Embarrassingly parallel MCMC using deep invertible transformations

Diego Mesquita, Paul Blomstedt, Samuel Kaski

arXiv:1903.04556v210.719 citations

Originality Highly original

AI Analysis

This addresses the problem of efficient Bayesian inference for large datasets, offering a novel aggregation strategy that is incremental but improves upon existing parallel MCMC methods.

The paper tackles the challenge of scaling MCMC to large distributed datasets by introducing a deep invertible transformation method to approximate subposteriors, enabling efficient aggregation with low communication costs and outperforming state-of-the-art methods in high-dimensional and heterogeneous scenarios.

While MCMC methods have become a main work-horse for Bayesian inference, scaling them to large distributed datasets is still a challenge. Embarrassingly parallel MCMC strategies take a divide-and-conquer stance to achieve this by writing the target posterior as a product of subposteriors, running MCMC for each of them in parallel and subsequently combining the results. The challenge then lies in devising efficient aggregation strategies. Current strategies trade-off between approximation quality, and costs of communication and computation. In this work, we introduce a novel method that addresses these issues simultaneously. Our key insight is to introduce a deep invertible transformation to approximate each of the subposteriors. These approximations can be made accurate even for complex distributions and serve as intermediate representations, keeping the total communication cost limited. Moreover, they enable us to sample from the product of the subposteriors using an efficient and stable importance sampling scheme. We demonstrate the approach outperforms available state-of-the-art methods in a range of challenging scenarios, including high-dimensional and heterogeneous subposteriors.

View on arXiv PDF

Similar