MLAILGDSMay 21, 2019

Robustness Against Outliers For Deep Neural Networks By Gradient Conjugate Priors

arXiv:1905.08464v1
Originality Highly original
AI Analysis

This addresses robustness against outliers for deep neural networks in regression tasks, offering a novel correction method with theoretical guarantees.

The paper tackles the problem of robust probability distribution reconstruction in the presence of output outliers by proposing a gradient conjugate prior (GCP) network, showing that the fitted mean is within a c e^{-1/ε}-neighborhood and corrected variance within a bε-neighborhood of the ground truth, with experiments indicating it outperforms other robust regression methods.

We analyze a new robust method for the reconstruction of probability distributions of observed data in the presence of output outliers. It is based on a so-called gradient conjugate prior (GCP) network which outputs the parameters of a prior. By rigorously studying the dynamics of the GCP learning process, we derive an explicit formula for correcting the obtained variance of the marginal distribution and removing the bias caused by outliers in the training set. Assuming a Gaussian (input-dependent) ground truth distribution contaminated with a proportion $\varepsilon$ of outliers, we show that the fitted mean is in a $c e^{-1/\varepsilon}$-neighborhood of the ground truth mean and the corrected variance is in a $b\varepsilon$-neighborhood of the ground truth variance, whereas the uncorrected variance of the marginal distribution can even be infinite. We explicitly find $b$ as a function of the output of the GCP network, without a priori knowledge of the outliers (possibly input-dependent) distribution. Experiments with synthetic and real-world data sets indicate that the GCP network fitted with a standard optimizer outperforms other robust methods for regression.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes