LGMLMar 13, 2024

Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks

arXiv:2403.08652v11 citationsh-index: 12Defense + Commercial Sensing
AI Analysis

This addresses the need for understanding DNN reasoning in mission-critical applications, though it appears incremental as it builds on existing black-box methods.

The paper tackles the problem of deep neural networks lacking task confidence and interpretability by proposing a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates, showing it can significantly improve interpretability and reliability on the CIFAR-10 dataset.

Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes