LGMLDec 4, 2019

Regression with Uncertainty Quantification in Large Scale Complex Data

arXiv:1912.02163v1
Originality Incremental advance
AI Analysis

It addresses the problem of scalable uncertainty estimation for practitioners dealing with noisy or volatile data, though it is incremental as it builds on existing methods.

The paper tackles uncertainty quantification in regression for large, complex datasets by proposing a simplified Mixture Density Networks approach, showing improvements in predictive log-likelihood and root-mean-square-error on benchmarks and applying it to stock price prediction and age estimation from face images.

While several methods for predicting uncertainty on deep networks have been recently proposed, they do not readily translate to large and complex datasets. In this paper we utilize a simplified form of the Mixture Density Networks (MDNs) to produce a one-shot approach to quantify uncertainty in regression problems. We show that our uncertainty bounds are on-par or better than other reported existing methods. When applied to standard regression benchmark datasets, we show an improvement in predictive log-likelihood and root-mean-square-error when compared to existing state-of-the-art methods. We also demonstrate this method's efficacy on stochastic, highly volatile time-series data where stock prices are predicted for the next time interval. The resulting uncertainty graph summarizes significant anomalies in the stock price chart. Furthermore, we apply this method to the task of age estimation from the challenging IMDb-Wiki dataset of half a million face images. We successfully predict the uncertainties associated with the prediction and empirically analyze the underlying causes of the uncertainties. This uncertainty quantification can be used to pre-process low quality datasets and further enable learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes