MLLGNov 11, 2022

Online Linearized LASSO

arXiv:2211.06039v27 citationsh-index: 14
Originality Highly original
AI Analysis

This work addresses the need for efficient online variable selection in streaming data analysis, offering a novel method with theoretical guarantees.

The paper tackles the problem of sparse regression for streaming data by proposing an online sparse linear regression framework, achieving an optimal statistical error rate of $ ilde{O}(\sqrt{s/t})$ with reduced memory and assumptions.

Sparse regression has been a popular approach to perform variable selection and enhance the prediction accuracy and interpretability of the resulting statistical model. Existing approaches focus on offline regularized regression, while the online scenario has rarely been studied. In this paper, we propose a novel online sparse linear regression framework for analyzing streaming data when data points arrive sequentially. Our proposed method is memory efficient and requires less stringent restricted strong convexity assumptions. Theoretically, we show that with a properly chosen regularization parameter, the $\ell_2$-norm statistical error of our estimator diminishes to zero in the optimal order of $\tilde{O}({\sqrt{s/t}})$, where $s$ is the sparsity level, $t$ is the streaming sample size, and $\tilde{O}(\cdot)$ hides logarithmic terms. Numerical experiments demonstrate the practical efficiency of our algorithm.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes