PL LGJan 9, 2023

Fast and Correct Gradient-Based Optimisation for Probabilistic Programming via Smoothing

Basim Khajwal, C. -H. Luke Ong, Dominik Wagner

arXiv:2301.03415v12.34 citationsh-index: 28

Originality Highly original

AI Analysis

This addresses a foundational issue in variational inference for probabilistic programming, enabling reliable and efficient optimization in the presence of discontinuities.

The paper tackles the problem of ensuring correctness in gradient-based optimization for probabilistic programming with discontinuities, by introducing a smoothed semantics and type systems to prove correctness, achieving orders of magnitude reduction in work-normalised variance.

We study the foundations of variational inference, which frames posterior inference as an optimisation problem, for probabilistic programming. The dominant approach for optimisation in practice is stochastic gradient descent. In particular, a variant using the so-called reparameterisation gradient estimator exhibits fast convergence in a traditional statistics setting. Unfortunately, discontinuities, which are readily expressible in programming languages, can compromise the correctness of this approach. We consider a simple (higher-order, probabilistic) programming language with conditionals, and we endow our language with both a measurable and a smoothed (approximate) value semantics. We present type systems which establish technical pre-conditions. Thus we can prove stochastic gradient descent with the reparameterisation gradient estimator to be correct when applied to the smoothed problem. Besides, we can solve the original problem up to any error tolerance by choosing an accuracy coefficient suitably. Empirically we demonstrate that our approach has a similar convergence as a key competitor, but is simpler, faster, and attains orders of magnitude reduction in work-normalised variance.

View on arXiv PDF

Similar