STEMMLJun 12, 2020

Detangling robustness in high dimensions: composite versus model-averaged estimation

arXiv:2006.07457v1
Originality Incremental advance
AI Analysis

This work addresses a gap in statistical theory for robust methods in high dimensions, providing a toolbox for practitioners, but it is incremental as it builds on existing equivalence concepts without introducing a new paradigm.

The paper tackles the problem of understanding robust methods in high-dimensional regularized estimation, focusing on prediction, and finds that model-averaged and composite quantile estimators often outperform least-squares methods, even with Gaussian noise, as shown in simulations and a real-world audio signal reconstruction application.

Robust methods, though ubiquitous in practice, are yet to be fully understood in the context of regularized estimation and high dimensions. Even simple questions become challenging very quickly. For example, classical statistical theory identifies equivalence between model-averaged and composite quantile estimation. However, little to nothing is known about such equivalence between methods that encourage sparsity. This paper provides a toolbox to further study robustness in these settings and focuses on prediction. In particular, we study optimally weighted model-averaged as well as composite $l_1$-regularized estimation. Optimal weights are determined by minimizing the asymptotic mean squared error. This approach incorporates the effects of regularization, without the assumption of perfect selection, as is often used in practice. Such weights are then optimal for prediction quality. Through an extensive simulation study, we show that no single method systematically outperforms others. We find, however, that model-averaged and composite quantile estimators often outperform least-squares methods, even in the case of Gaussian model noise. Real data application witnesses the method's practical use through the reconstruction of compressed audio signals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes