LG MLOct 8, 2021

Is MC Dropout Bayesian?

Loic Le Folgoc, Vasileios Baltatzis, Sujal Desai, Anand Devaraj, Sam Ellis, Octavio E. Martinez Manzanera, Arjun Nair, Huaqi Qiu, Julia Schnabel, Ben Glocker

arXiv:2110.04286v119.558 citations

Originality Incremental advance

AI Analysis

This work challenges a widely used method in medical imaging for uncertainty quantification, highlighting its flaws and offering a more robust alternative for researchers and practitioners.

The paper questions the Bayesian properties of MC Dropout, showing that it assigns zero probability to the true model on benchmarks and its multimodality is an artifact, and introduces a generic variational inference engine in PyTorch with structured multivariate normal families to address these issues.

MC Dropout is a mainstream "free lunch" method in medical imaging for approximate Bayesian computations (ABC). Its appeal is to solve out-of-the-box the daunting task of ABC and uncertainty quantification in Neural Networks (NNs); to fall within the variational inference (VI) framework; and to propose a highly multimodal, faithful predictive posterior. We question the properties of MC Dropout for approximate inference, as in fact MC Dropout changes the Bayesian model; its predictive posterior assigns $0$ probability to the true model on closed-form benchmarks; the multimodality of its predictive posterior is not a property of the true predictive posterior but a design artefact. To address the need for VI on arbitrary models, we share a generic VI engine within the pytorch framework. The code includes a carefully designed implementation of structured (diagonal plus low-rank) multivariate normal variational families, and mixtures thereof. It is intended as a go-to no-free-lunch approach, addressing shortcomings of mean-field VI with an adjustable trade-off between expressivity and computational complexity.

View on arXiv PDF

Similar