STLGAPMEMLApr 26, 2017

A Flexible Framework for Hypothesis Testing in High-dimensions

arXiv:1704.07971v430 citations
Originality Incremental advance
AI Analysis

This provides a general method for statisticians and data scientists working with high-dimensional data, though it is incremental as it builds on existing sparse regression assumptions.

The authors tackled the problem of hypothesis testing in high-dimensional linear regression where parameters exceed samples, by developing a flexible framework that controls type I error and achieves high power, with confidence intervals for linear functionals shown to be minimax rate optimal.

Hypothesis testing in the linear regression model is a fundamental statistical problem. We consider linear regression in the high-dimensional regime where the number of parameters exceeds the number of samples ($p> n$). In order to make informative inference, we assume that the model is approximately sparse, that is the effect of covariates on the response can be well approximated by conditioning on a relatively small number of covariates whose identities are unknown. We develop a framework for testing very general hypotheses regarding the model parameters. Our framework encompasses testing whether the parameter lies in a convex cone, testing the signal strength, and testing arbitrary functionals of the parameter. We show that the proposed procedure controls the type I error, and also analyze the power of the procedure. Our numerical experiments confirm our theoretical findings and demonstrate that we control false positive rate (type I error) near the nominal level, and have high power. By duality between hypotheses testing and confidence intervals, the proposed framework can be used to obtain valid confidence intervals for various functionals of the model parameters. For linear functionals, the length of confidence intervals is shown to be minimax rate optimal.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes