MLLGJul 21, 2019

Noise Regularization for Conditional Density Estimation

arXiv:1907.08982v238 citations
AI Analysis

This addresses overfitting in conditional density estimation for applications requiring full probability modeling, though it is incremental as it adapts existing noise regularization ideas to CDE.

The paper tackles overfitting in neural network-based conditional density estimation (CDE) by proposing a noise regularization method that adds random perturbations to data during training, which significantly outperforms other methods across seven datasets and three models, making neural CDE preferable even with scarce data.

Modelling statistical relationships beyond the conditional mean is crucial in many settings. Conditional density estimation (CDE) aims to learn the full conditional probability density from data. Though highly expressive, neural network based CDE models can suffer from severe over-fitting when trained with the maximum likelihood objective. Due to the inherent structure of such models, classical regularization approaches in the parameter space are rendered ineffective. To address this issue, we develop a model-agnostic noise regularization method for CDE that adds random perturbations to the data during training. We demonstrate that the proposed approach corresponds to a smoothness regularization and prove its asymptotic consistency. In our experiments, noise regularization significantly and consistently outperforms other regularization methods across seven data sets and three CDE models. The effectiveness of noise regularization makes neural network based CDE the preferable method over previous non- and semi-parametric approaches, even when training data is scarce.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes