LGAIMLNov 29, 2018

The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning

arXiv:1811.12535v131 citations
Originality Incremental advance
AI Analysis

This addresses the computational inefficiency of Bayesian neural networks for deep learning practitioners, offering a more efficient approach to uncertainty estimation, though it is incremental as it builds on existing Bayesian methods.

The study investigated whether fully Bayesian networks are necessary for capturing model uncertainty by varying the number and position of Bayesian layers in a network, finding that only a few Bayesian layers near the output can fully capture uncertainty while combining deterministic and Bayesian advantages.

One of the main challenges of deep learning tools is their inability to capture model uncertainty. While Bayesian deep learning can be used to tackle the problem, Bayesian neural networks often require more time and computational power to train than deterministic networks. Our work explores whether fully Bayesian networks are needed to successfully capture model uncertainty. We vary the number and position of Bayesian layers in a network and compare their performance on active learning with the MNIST dataset. We found that we can fully capture the model uncertainty by using only a few Bayesian layers near the output of the network, combining the advantages of deterministic and Bayesian networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes