LGMLJul 4, 2018

Discrete Sampling using Semigradient-based Product Mixtures

arXiv:1807.01808v22 citations
AI Analysis

This addresses the problem of inefficient inference in discrete models like determinantal point processes and Ising models for researchers and practitioners in machine learning, representing an incremental improvement over existing methods.

The paper tackles the slow convergence of Markov chain Monte Carlo algorithms for inference in discrete probabilistic models by proposing a novel sampling strategy that uses a mixture of product distributions to propose global moves, accelerating convergence, with practical demonstrations on real-world datasets.

We consider the problem of inference in discrete probabilistic models, that is, distributions over subsets of a finite ground set. These encompass a range of well-known models in machine learning, such as determinantal point processes and Ising models. Locally-moving Markov chain Monte Carlo algorithms, such as the Gibbs sampler, are commonly used for inference in such models, but their convergence is, at times, prohibitively slow. This is often caused by state-space bottlenecks that greatly hinder the movement of such samplers. We propose a novel sampling strategy that uses a specific mixture of product distributions to propose global moves and, thus, accelerate convergence. Furthermore, we show how to construct such a mixture using semigradient information. We illustrate the effectiveness of combining our sampler with existing ones, both theoretically on an example model, as well as practically on three models learned from real-world data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes