MLLGApr 2, 2022

Distributional Gradient Boosting Machines

arXiv:2204.00778v113 citationsh-index: 50
Originality Highly original
AI Analysis

This work enhances tree-based gradient boosting for regression by enabling probabilistic forecasts, which is useful for applications requiring prediction intervals and quantiles.

The paper tackles the problem of predicting the entire conditional distribution in regression tasks, introducing a unified probabilistic gradient boosting framework that models all conditional moments or approximates the cumulative distribution function, achieving state-of-the-art forecast accuracy.

We present a unified probabilistic gradient boosting framework for regression tasks that models and predicts the entire conditional distribution of a univariate response variable as a function of covariates. Our likelihood-based approach allows us to either model all conditional moments of a parametric distribution, or to approximate the conditional cumulative distribution function via Normalizing Flows. As underlying computational backbones, our framework is based on XGBoost and LightGBM. Modelling and predicting the entire conditional distribution greatly enhances existing tree-based gradient boosting implementations, as it allows to create probabilistic forecasts from which prediction intervals and quantiles of interest can be derived. Empirical results show that our framework achieves state-of-the-art forecast accuracy.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes