STLGMLMar 20, 2019

TATi-Thermodynamic Analytics ToolkIt: TensorFlow-based software for posterior sampling in machine learning applications

arXiv:1903.08640v25 citations
AI Analysis

This work provides a software tool for researchers and practitioners in machine learning to improve Bayesian inference, though it is incremental as it builds on existing sampling methods.

The authors tackled the challenge of Bayesian posterior sampling for neural networks by developing a TensorFlow-based software toolkit that employs ensemble quasi-Newton preconditioning to enhance sampling efficiency, demonstrating its application on the MNIST dataset with visualizations of the loss landscape.

With the advent of GPU-assisted hardware and maturing high-efficiency software platforms such as TensorFlow and PyTorch, Bayesian posterior sampling for neural networks becomes plausible. In this article we discuss Bayesian parametrization in machine learning based on Markov Chain Monte Carlo methods, specifically discretized stochastic differential equations such as Langevin dynamics and extended system methods in which an ensemble of walkers is employed to enhance sampling. We provide a glimpse of the potential of the sampling-intensive approach by studying (and visualizing) the loss landscape of a neural network applied to the MNIST data set. Moreover, we investigate how the sampling efficiency itself can be significantly enhanced through an ensemble quasi-Newton preconditioning method. This article accompanies the release of a new TensorFlow software package, the Thermodynamic Analytics ToolkIt, which is used in the computational experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes