CVLGMLDec 4, 2020

Encoding the latent posterior of Bayesian Neural Networks for uncertainty quantification

arXiv:2012.02818v237 citations
AI Analysis

This work aims to make Bayesian Neural Networks more efficient and scalable for researchers and practitioners working on complex computer vision problems, offering an alternative to computationally heavy methods like Deep Ensembles.

This paper addresses the scalability issues of Bayesian Neural Networks (BNNs) for uncertainty quantification in complex computer vision tasks. The authors propose Latent-Posterior BNN (LP-BNN), which uses variational autoencoders to learn the latent distribution of network parameters, achieving competitive results in image classification, semantic segmentation, and out-of-distribution detection.

Bayesian neural networks (BNNs) have been long considered an ideal, yet unscalable solution for improving the robustness and the predictive uncertainty of deep neural networks. While they could capture more accurately the posterior distribution of the network parameters, most BNN approaches are either limited to small networks or rely on constraining assumptions such as parameter independence. These drawbacks have enabled prominence of simple, but computationally heavy approaches such as Deep Ensembles, whose training and testing costs increase linearly with the number of networks. In this work we aim for efficient deep BNNs amenable to complex computer vision architectures, e.g. ResNet50 DeepLabV3+, and tasks, e.g. semantic segmentation, with fewer assumptions on the parameters. We achieve this by leveraging variational autoencoders (VAEs) to learn the interaction and the latent distribution of the parameters at each network layer. Our approach, Latent-Posterior BNN (LP-BNN), is compatible with the recent BatchEnsemble method, leading to highly efficient ({in terms of computation and} memory during both training and testing) ensembles. LP-BNN s attain competitive results across multiple metrics in several challenging benchmarks for image classification, semantic segmentation and out-of-distribution detection.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes