STMLAug 4, 2013

Risk-consistency of cross-validation with lasso-type procedures

arXiv:1308.0810v324 citations
Originality Incremental advance
AI Analysis

This provides theoretical justification for a common practice in high-dimensional statistics, addressing a gap for researchers and practitioners using lasso variants, though it is incremental as it builds on existing assumptions and methods.

The paper tackles the lack of theoretical guarantees for cross-validation in selecting tuning parameters for lasso-type methods in high-dimensional settings, showing that it achieves oracle prediction rates up to a log factor under typical assumptions and remains risk-consistent even when the true model is non-linear.

The lasso and related sparsity inducing algorithms have been the target of substantial theoretical and applied research. Correspondingly, many results are known about their behavior for a fixed or optimally chosen tuning parameter specified up to unknown constants. In practice, however, this oracle tuning parameter is inaccessible so one must use the data to select one. Common statistical practice is to use a variant of cross-validation for this task. However, little is known about the theoretical properties of the resulting predictions with such data-dependent methods. We consider the high-dimensional setting with random design wherein the number of predictors $p$ grows with the number of observations $n$. Under typical assumptions on the data generating process, similar to those in the literature, we recover oracle rates up to a log factor when choosing the tuning parameter with cross-validation. Under weaker conditions, when the true model is not necessarily linear, we show that the lasso remains risk consistent relative to its linear oracle. We also generalize these results to the group lasso and square-root lasso and investigate the predictive and model selection performance of cross-validation via simulation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes