LGEMSTMLJan 11, 2019

Non-Parametric Inference Adaptive to Intrinsic Dimension

arXiv:1901.03719v39 citations
Originality Incremental advance
AI Analysis

This addresses high-dimensional statistical inference problems for researchers, providing adaptive methods with theoretical guarantees, but it is incremental as it builds on existing k-NN and sub-sampling techniques.

The paper tackles non-parametric estimation and inference in high-dimensional conditional moment models where the covariate dimension exceeds sample size, showing that estimation is feasible if the intrinsic dimension is small, with finite sample error of order n^{-1/(d+2)} and asymptotic normality irrespective of the ambient dimension.

We consider non-parametric estimation and inference of conditional moment models in high dimensions. We show that even when the dimension $D$ of the conditioning variable is larger than the sample size $n$, estimation and inference is feasible as long as the distribution of the conditioning variable has small intrinsic dimension $d$, as measured by locally low doubling measures. Our estimation is based on a sub-sampled ensemble of the $k$-nearest neighbors ($k$-NN) $Z$-estimator. We show that if the intrinsic dimension of the covariate distribution is equal to $d$, then the finite sample estimation error of our estimator is of order $n^{-1/(d+2)}$ and our estimate is $n^{1/(d+2)}$-asymptotically normal, irrespective of $D$. The sub-sampling size required for achieving these results depends on the unknown intrinsic dimension $d$. We propose an adaptive data-driven approach for choosing this parameter and prove that it achieves the desired rates. We discuss extensions and applications to heterogeneous treatment effect estimation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes