LGMLOct 10, 2019

Information Aware Max-Norm Dirichlet Networks for Predictive Uncertainty Estimation

arXiv:1910.04819v411 citations
Originality Highly original
AI Analysis

This addresses the critical need for precise uncertainty estimation in AI systems to ensure trust and safety, representing a novel method rather than an incremental improvement.

The paper tackles the problem of overconfident predictions in deep neural networks by proposing Information Aware Dirichlet networks, which learn a Dirichlet prior on predictive distributions to improve uncertainty estimation, achieving large-margin improvements over state-of-the-art methods on real datasets for within-distribution, out-of-distribution uncertainty, and adversarial example detection.

Precise estimation of uncertainty in predictions for AI systems is a critical factor in ensuring trust and safety. Deep neural networks trained with a conventional method are prone to over-confident predictions. In contrast to Bayesian neural networks that learn approximate distributions on weights to infer prediction confidence, we propose a novel method, Information Aware Dirichlet networks, that learn an explicit Dirichlet prior distribution on predictive distributions by minimizing a bound on the expected max norm of the prediction error and penalizing information associated with incorrect outcomes. Properties of the new cost function are derived to indicate how improved uncertainty estimation is achieved. Experiments using real datasets show that our technique outperforms, by a large margin, state-of-the-art neural networks for estimating within-distribution and out-of-distribution uncertainty, and detecting adversarial examples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes