ST DC LGMar 11, 2014

Robust and Scalable Bayes via a Median of Subset Posterior Measures

Stanislav Minsker, Sanvesh Srivastava, Lizhen Lin, David B. Dunson

arXiv:1403.2660v3114 citations

Originality Highly original

AI Analysis

This addresses the need for robust and scalable Bayesian methods in data analysis, offering a novel aggregation technique that improves upon standard approaches.

The paper tackles the problem of making Bayesian analysis robust to outliers and computationally efficient by splitting data into subgroups, computing posteriors for each, and aggregating them using a median in probability measure space, resulting in provable robustness and computational advantages.

We propose a novel approach to Bayesian analysis that is provably robust to outliers in the data and often has computational advantages over standard methods. Our technique is based on splitting the data into non-overlapping subgroups, evaluating the posterior distribution given each independent subgroup, and then combining the resulting measures. The main novelty of our approach is the proposed aggregation step, which is based on the evaluation of a median in the space of probability measures equipped with a suitable collection of distances that can be quickly and efficiently evaluated in practice. We present both theoretical and numerical evidence illustrating the improvements achieved by our method.

View on arXiv PDF

Similar