LG CVFeb 1, 2025

Explorations of the Softmax Space: Knowing When the Neural Network Doesn't Know

Daniel Sikar, Artur d'Avila Garcez, Tillman Weyde

arXiv:2502.00456v29.42 citationsh-index: 2

Originality Incremental advance

AI Analysis

This addresses the need for reliable automated decision-making in AI systems, particularly for critical situations, though it is incremental as it builds on existing softmax-based confidence methods.

The paper tackles the problem of measuring confidence in neural network predictions to improve reliability in critical applications, proposing a method based on clustering softmax outputs and distance thresholds to identify when predictions should be deferred as 'not known', with results showing consistency across MNIST and CIFAR-10 datasets using CNN and Vision Transformer models.

Ensuring the reliability of automated decision-making based on neural networks will be crucial as Artificial Intelligence systems are deployed more widely in critical situations. This paper proposes a new approach for measuring confidence in the predictions of any neural network that relies on the predictions of a softmax layer. We identify that a high-accuracy trained network may have certain outputs for which there should be low confidence. In such cases, decisions should be deferred and it is more appropriate for the network to provide a \textit{not known} answer to a corresponding classification task. Our approach clusters the vectors in the softmax layer to measure distances between cluster centroids and network outputs. We show that a cluster with centroid calculated simply as the mean softmax output for all correct predictions can serve as a suitable proxy in the evaluation of confidence. Defining a distance threshold for a class as the smallest distance from an incorrect prediction to the given class centroid offers a simple approach to adding \textit{not known} answers to any network classification falling outside of the threshold. We evaluate the approach on the MNIST and CIFAR-10 datasets using a Convolutional Neural Network and a Vision Transformer, respectively. The results show that our approach is consistent across datasets and network models, and indicate that the proposed distance metric can offer an efficient way of determining when automated predictions are acceptable and when they should be deferred to human operators.

View on arXiv PDF

Similar