LG AI MLNov 29, 2018

The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning

Jiaming Zeng, Adam Lesnikowski, Jose M. Alvarez

arXiv:1811.12535v19.831 citations

Originality Incremental advance

AI Analysis

This addresses the computational inefficiency of Bayesian neural networks for deep learning practitioners, offering a more efficient approach to uncertainty estimation, though it is incremental as it builds on existing Bayesian methods.

The study investigated whether fully Bayesian networks are necessary for capturing model uncertainty by varying the number and position of Bayesian layers in a network, finding that only a few Bayesian layers near the output can fully capture uncertainty while combining deterministic and Bayesian advantages.

One of the main challenges of deep learning tools is their inability to capture model uncertainty. While Bayesian deep learning can be used to tackle the problem, Bayesian neural networks often require more time and computational power to train than deterministic networks. Our work explores whether fully Bayesian networks are needed to successfully capture model uncertainty. We vary the number and position of Bayesian layers in a network and compare their performance on active learning with the MNIST dataset. We found that we can fully capture the model uncertainty by using only a few Bayesian layers near the output of the network, combining the advantages of deterministic and Bayesian networks.

View on arXiv PDF

Similar