LG AI MLFeb 20, 2024

Investigating the Histogram Loss in Regression

Ehsan Imani, Kai Luedemann, Sam Scholnick-Hughes, Esraa Elelimy, Martha White

arXiv:2402.13425v213.410 citationsh-index: 22Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of understanding performance gains in distribution-based regression for machine learning practitioners, though it is incremental as it analyzes an existing method.

The paper investigates the Histogram Loss in regression, which models the entire distribution of the target variable, and finds that performance gains arise from optimization improvements rather than extra information modeling, demonstrating its viability in deep learning applications without costly tuning.

It is becoming increasingly common in regression to train neural networks that model the entire distribution even if only the mean is required for prediction. This additional modeling often comes with performance gain and the reasons behind the improvement are not fully known. This paper investigates a recent approach to regression, the Histogram Loss, which involves learning the conditional distribution of the target variable by minimizing the cross-entropy between a target distribution and a flexible histogram prediction. We design theoretical and empirical analyses to determine why and when this performance gain appears, and how different components of the loss contribute to it. Our results suggest that the benefits of learning distributions in this setup come from improvements in optimization rather than modelling extra information. We then demonstrate the viability of the Histogram Loss in common deep learning applications without a need for costly hyperparameter tuning.

View on arXiv PDF Code

Similar