LG MLMar 15, 2024

Fast and reliable uncertainty quantification with neural network ensembles for industrial image classification

arXiv:2403.10182v57.99 citationsh-index: 23Ann Oper Res

Originality Incremental advance

AI Analysis

This work addresses reliability issues for industrial applications where models encounter unknown objects, though it is incremental as it builds on existing ensemble techniques.

The study tackled the problem of unreliable predictions from neural networks on out-of-distribution data in industrial image classification by comparing efficient ensemble methods, finding that the batch ensemble matches deep ensembles in accuracy and uncertainty quantification while reducing computational costs by significant savings in training time, test time, and memory storage.

Image classification with neural networks (NNs) is widely used in industrial processes, situations where the model likely encounters unknown objects during deployment, i.e., out-of-distribution (OOD) data. Worryingly, NNs tend to make confident yet incorrect predictions when confronted with OOD data. To increase the models' reliability, they should quantify the uncertainty in their own predictions, communicating when the output should (not) be trusted. Deep ensembles, composed of multiple independent NNs, have been shown to perform strongly but are computationally expensive. Recent research has proposed more efficient NN ensembles, namely the snapshot, batch, and multi-input multi-output ensemble. This study investigates the predictive and uncertainty performance of efficient NN ensembles in the context of image classification for industrial processes. It is the first to provide a comprehensive comparison and it proposes a novel Diversity Quality metric to quantify the ensembles' performance on the in-distribution and OOD sets in one single metric. The results highlight the batch ensemble as a cost-effective and competitive alternative to the deep ensemble. It matches the deep ensemble in both uncertainty and accuracy while exhibiting considerable savings in training time, test time, and memory storage.

View on arXiv PDF

Similar