LGAIMLOct 24, 2020

Out-of-distribution detection for regression tasks: parameter versus predictor entropy

arXiv:2010.12995v2
Originality Incremental advance
AI Analysis

This addresses the problem of ensuring model trustworthiness in regression for machine learning practitioners, though it appears incremental as it builds on existing diversity-based OOD detection approaches.

The paper tackles out-of-distribution (OOD) detection in regression tasks by proposing a method to estimate predictor entropy using nearest neighbors in function space, which systematically generates diverse predictors and improves OOD detection robustness.

It is crucial to detect when an instance lies downright too far from the training samples for the machine learning model to be trusted, a challenge known as out-of-distribution (OOD) detection. For neural networks, one approach to this task consists of learning a diversity of predictors that all can explain the training data. This information can be used to estimate the epistemic uncertainty at a given newly observed instance in terms of a measure of the disagreement of the predictions. Evaluation and certification of the ability of a method to detect OOD require specifying instances which are likely to occur in deployment yet on which no prediction is available. Focusing on regression tasks, we choose a simple yet insightful model for this OOD distribution and conduct an empirical evaluation of the ability of various methods to discriminate OOD samples from the data. Moreover, we exhibit evidence that a diversity of parameters may fail to translate to a diversity of predictors. Based on the choice of an OOD distribution, we propose a new way of estimating the entropy of a distribution on predictors based on nearest neighbors in function space. This leads to a variational objective which, combined with the family of distributions given by a generative neural network, systematically produces a diversity of predictors that provides a robust way to detect OOD samples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes