LGSTMLDec 18, 2019

Learning under Model Misspecification: Applications to Variational and Ensemble methods

arXiv:1912.08335v539 citations
Originality Highly original
AI Analysis

This addresses the problem of model misspecification in machine learning, offering theoretical insights and practical algorithms for improved predictive performance, though it is incremental in building on existing PAC-Bayes bounds.

The paper analyzes Bayesian model averaging under model misspecification, showing it provides suboptimal generalization performance, and introduces new Bayesian-like algorithms as variational and ensemble methods that improve generalization.

Virtually any model we use in machine learning to make predictions does not perfectly represent reality. So, most of the learning happens under model misspecification. In this work, we present a novel analysis of the generalization performance of Bayesian model averaging under model misspecification and i.i.d. data using a new family of second-order PAC-Bayes bounds. This analysis shows, in simple and intuitive terms, that Bayesian model averaging provides suboptimal generalization performance when the model is misspecified. In consequence, we provide strong theoretical arguments showing that Bayesian methods are not optimal for learning predictive models, unless the model class is perfectly specified. Using novel second-order PAC-Bayes bounds, we derive a new family of Bayesian-like algorithms, which can be implemented as variational and ensemble methods. The output of these algorithms is a new posterior distribution, different from the Bayesian posterior, which induces a posterior predictive distribution with better generalization performance. Experiments with Bayesian neural networks illustrate these findings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes