ST MLDec 22, 2014

An $\{l_1,l_2,l_{\infty}\}$-Regularization Approach to High-Dimensional Errors-in-variables Models

Alexandre Belloni, Mathieu Rosenbaum, Alexandre B. Tsybakov

arXiv:1412.7216v110 citations

Originality Incremental advance

AI Analysis

This work addresses high-dimensional errors-in-variables models for statistical inference, but it is incremental as it builds on recent methods by adding regularization.

The paper tackles the problem of linear regression with errors in the design by proposing two new estimators that incorporate an additional l∞-norm regularization, applicable under different assumptions without requiring accurate pilot estimators. It establishes convergence rates and compares them to existing bounds, showing insights into how assumptions affect achievable rates.

Several new estimation methods have been recently proposed for the linear regression model with observation error in the design. Different assumptions on the data generating process have motivated different estimators and analysis. In particular, the literature considered (1) observation errors in the design uniformly bounded by some $\bar δ$, and (2) zero mean independent observation errors. Under the first assumption, the rates of convergence of the proposed estimators depend explicitly on $\bar δ$, while the second assumption has been applied when an estimator for the second moment of the observational error is available. This work proposes and studies two new estimators which, compared to other procedures for regression models with errors in the design, exploit an additional $l_{\infty}$-norm regularization. The first estimator is applicable when both (1) and (2) hold but does not require an estimator for the second moment of the observational error. The second estimator is applicable under (2) and requires an estimator for the second moment of the observation error. Importantly, we impose no assumption on the accuracy of this pilot estimator, in contrast to the previously known procedures. As the recent proposals, we allow the number of covariates to be much larger than the sample size. We establish the rates of convergence of the estimators and compare them with the bounds obtained for related estimators in the literature. These comparisons show interesting insights on the interplay of the assumptions and the achievable rates of convergence.

View on arXiv PDF

Similar