STLGMLNov 24, 2019

Histogram Transform Ensembles for Density Estimation

arXiv:1911.11581v1
Originality Incremental advance
AI Analysis

This provides a theoretically grounded ensemble method for density estimation, with potential applications in statistics and machine learning, though it appears incremental as an extension of histogram transforms.

The paper tackles density estimation by proposing histogram transform ensembles (HTE), which achieve universal consistency and almost optimal convergence rates in Hölder spaces, with experiments showing HTE surpasses single transforms and adaptive HTE outperforms state-of-the-art methods on real data.

We investigate an algorithm named histogram transform ensembles (HTE) density estimator whose effectiveness is supported by both solid theoretical analysis and significant experimental performance. On the theoretical side, by decomposing the error term into approximation error and estimation error, we are able to conduct the following analysis: First of all, we establish the universal consistency under $L_1(μ)$-norm. Secondly, under the assumption that the underlying density function resides in the Hölder space $C^{0,α}$, we prove almost optimal convergence rates for both single and ensemble density estimators under $L_1(μ)$-norm and $L_{\infty}(μ)$-norm for different tail distributions, whereas in contrast, for its subspace $C^{1,α}$ consisting of smoother functions, almost optimal convergence rates can only be established for the ensembles and the lower bound of the single estimators illustrates the benefits of ensembles over single density estimators. In the experiments, we first carry out simulations to illustrate that histogram transform ensembles surpass single histogram transforms, which offers powerful evidence to support the theoretical results in the space $C^{1,α}$. Moreover, to further exert the experimental performances, we propose an adaptive version of HTE and study the parameters by generating several synthetic datasets with diversities in dimensions and distributions. Last but not least, real data experiments with other state-of-the-art density estimators demonstrate the accuracy of the adaptive HTE algorithm.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes