LGAIMLDec 11, 2023

An Ambiguity Measure for Recognizing the Unknowns in Deep Learning

arXiv:2312.06077v1h-index: 7
Originality Incremental advance
AI Analysis

This addresses reliability issues in deep learning for applications requiring safe deployment, though it is incremental as it builds on existing uncertainty and robustness methods.

The paper tackles the problem of deep neural networks making mistakes on seemingly trivial cases and failing to recognize unknown inputs, by proposing an ambiguity measure based on geometric arrangements in feature space to detect errors and ambiguous inputs, enabling models to abstain from classification and improve accuracy.

We study the understanding of deep neural networks from the scope in which they are trained on. While the accuracy of these models is usually impressive on the aggregate level, they still make mistakes, sometimes on cases that appear to be trivial. Moreover, these models are not reliable in realizing what they do not know leading to failures such as adversarial vulnerability and out-of-distribution failures. Here, we propose a measure for quantifying the ambiguity of inputs for any given model with regard to the scope of its training. We define the ambiguity based on the geometric arrangements of the decision boundaries and the convex hull of training set in the feature space learned by the trained model, and demonstrate that a single ambiguity measure may detect a considerable portion of mistakes of a model on in-distribution samples, adversarial inputs, as well as out-of-distribution inputs. Using our ambiguity measure, a model may abstain from classification when it encounters ambiguous inputs leading to a better model accuracy not just on a given testing set, but on the inputs it may encounter at the world at large. In pursuit of this measure, we develop a theoretical framework that can identify the unknowns of the model in relation to its scope. We put this in perspective with the confidence of the model and develop formulations to identify the regions of the domain which are unknown to the model, yet the model is guaranteed to have high confidence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes