LGAIOct 5, 2020

Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model

arXiv:2010.02065v413 citations
AI Analysis

This addresses the practical need for reliable error detection in deployed neural network classifiers, though it appears to be an incremental improvement over existing confidence-based methods.

The paper tackles the problem of detecting misclassification errors in neural networks by developing a new framework called RED that uses Gaussian Processes to estimate uncertainty in detection scores. Experimental results on 125 UCI datasets and vision tasks show the method is effective, robust, and scalable.

As neural network classifiers are deployed in real-world applications, it is crucial that their failures can be detected reliably. One practical solution is to assign confidence scores to each prediction, then use these scores to filter out possible misclassifications. However, existing confidence metrics are not yet sufficiently reliable for this role. This paper presents a new framework that produces a quantitative metric for detecting misclassification errors. This framework, RED, builds an error detector on top of the base classifier and estimates uncertainty of the detection scores using Gaussian Processes. Experimental comparisons with other error detection methods on 125 UCI datasets demonstrate that this approach is effective. Further implementations on two probabilistic base classifiers and two large deep learning architecture in vision tasks further confirm that the method is robust and scalable. Third, an empirical analysis of RED with out-of-distribution and adversarial samples shows that the method can be used not only to detect errors but also to understand where they come from. RED can thereby be used to improve trustworthiness of neural network classifiers more broadly in the future.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes