NAOct 4, 2012
On the convergence of the Escalator Boxcar TrainÅke Brännström, Linus Carlsson, Daniel Simpson
The Escalator Boxcar Train (EBT) is a numerical method that is widely used in theoretical biology to investigate the dynamics of physiologically structured population models, i.e., models in which individuals differ by size or other physiological characteristics. The method was developed more than two decades ago, but has so far resisted attempts to give a formal proof of convergence. Using a modern framework of measure-valued solutions, we investigate the EBT method and show that the sequence of approximating solution measures generated by the EBT method converges weakly to the true solution measure under weak conditions on the growth rate, birth rate, and mortality rate. In rigorously establishing the convergence of the EBT method, our results pave the way for wider acceptance of the EBT method beyond theoretical biology and constitutes an important step towards integration with established numerical schemes.
MLFeb 7, 2018
Yes, but Did It Work?: Evaluating Variational InferenceYuling Yao, Aki Vehtari, Daniel Simpson et al.
While it's always possible to compute a variational approximation to a posterior distribution, it can be difficult to discover problems with this approximation. We propose two diagnostic algorithms to alleviate this problem. The Pareto-smoothed importance sampling (PSIS) diagnostic gives a goodness of fit measurement for joint distributions, while simultaneously improving the error in the estimate. The variational simulation-based calibration (VSBC) assesses the average performance of point estimates.
COJul 9, 2015
Pareto Smoothed Importance SamplingAki Vehtari, Daniel Simpson, Andrew Gelman et al.
Importance weighting is a general way to adjust Monte Carlo integration to account for draws from the wrong distribution, but the resulting estimate can be highly variable when the importance ratios have a heavy right tail. This routinely occurs when there are aspects of the target distribution that are not well captured by the approximating distribution, in which case more stable estimates can be obtained by modifying extreme importance ratios. We present a new method for stabilizing importance weights using a generalized Pareto distribution fit to the upper tail of the distribution of the simulated importance ratios. The method, which empirically performs better than existing methods for stabilizing importance sampling estimates, includes stabilized effective sample size estimates, Monte Carlo error estimates, and convergence diagnostics. The presented Pareto $\hat{k}$ finite sample convergence rate diagnostic is useful for any Monte Carlo estimator.
MEJun 17, 2013
On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable LikelihoodsAnne-Marie Lyne, Mark Girolami, Yves Atchadé et al.
A large number of statistical models are "doubly-intractable": the likelihood normalising term, which is a function of the model parameters, is intractable, as well as the marginal likelihood (model evidence). This means that standard inference techniques to sample from the posterior, such as Markov chain Monte Carlo (MCMC), cannot be used. Examples include, but are not confined to, massive Gaussian Markov random fields, autologistic models and Exponential random graph models. A number of approximate schemes based on MCMC techniques, Approximate Bayesian computation (ABC) or analytic approximations to the posterior have been suggested, and these are reviewed here. Exact MCMC schemes, which can be applied to a subset of doubly-intractable distributions, have also been developed and are described in this paper. As yet, no general method exists which can be applied to all classes of models with doubly-intractable posteriors. In addition, taking inspiration from the Physics literature, we study an alternative method based on representing the intractable likelihood as an infinite series. Unbiased estimates of the likelihood can then be obtained by finite time stochastic truncation of the series via Russian Roulette sampling, although the estimates are not necessarily positive. Results from the Quantum Chromodynamics literature are exploited to allow the use of possibly negative estimates in a pseudo-marginal MCMC scheme such that expectations with respect to the posterior distribution are preserved. The methodology is reviewed on well-known examples such as the parameters in Ising models, the posterior for Fisher-Bingham distributions on the $d$-Sphere and a large-scale Gaussian Markov Random Field model describing the Ozone Column data. This leads to a critical assessment of the strengths and weaknesses of the methodology with pointers to ongoing research.