LG MLMar 13, 2024

Extracting Explanations, Justification, and Uncertainty from Black-Box Deep Neural Networks

arXiv:2403.08652v12.61 citationsh-index: 12Defense + Commercial Sensing

Originality Incremental advance

AI Analysis

This addresses the need for understanding DNN reasoning in mission-critical applications, though it appears incremental as it builds on existing black-box methods.

The paper tackles the problem of deep neural networks lacking task confidence and interpretability by proposing a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates, showing it can significantly improve interpretability and reliability on the CIFAR-10 dataset.

Deep Neural Networks (DNNs) do not inherently compute or exhibit empirically-justified task confidence. In mission critical applications, it is important to both understand associated DNN reasoning and its supporting evidence. In this paper, we propose a novel Bayesian approach to extract explanations, justifications, and uncertainty estimates from DNNs. Our approach is efficient both in terms of memory and computation, and can be applied to any black box DNN without any retraining, including applications to anomaly detection and out-of-distribution detection tasks. We validate our approach on the CIFAR-10 dataset, and show that it can significantly improve the interpretability and reliability of DNNs.

View on arXiv PDF

Similar