MLLGApr 27, 2023

Resampling Gradients Vanish in Differentiable Sequential Monte Carlo Samplers

arXiv:2304.14390v13 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses a specific issue in differentiable sampling methods for researchers in machine learning, offering an incremental improvement.

The paper tackles the problem of low effective sample size in Differentiable Annealed Importance Sampling (DAIS) by proposing an extension with a resampling step inspired by Sequential Monte Carlo, and finds that differentiating through this step is unnecessary, avoiding gradient variance issues.

Annealed Importance Sampling (AIS) moves particles along a Markov chain from a tractable initial distribution to an intractable target distribution. The recently proposed Differentiable AIS (DAIS) (Geffner and Domke, 2021; Zhang et al., 2021) enables efficient optimization of the transition kernels of AIS and of the distributions. However, we observe a low effective sample size in DAIS, indicating degenerate distributions. We thus propose to extend DAIS by a resampling step inspired by Sequential Monte Carlo. Surprisingly, we find empirically-and can explain theoretically-that it is not necessary to differentiate through the resampling step which avoids gradient variance issues observed in similar approaches for Particle Filters (Maddison et al., 2017; Naesseth et al., 2018; Le et al., 2018).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes