LGMLFeb 27, 2019

Provable Approximations for Constrained $\ell_p$ Regression

arXiv:1902.10407v12 citationsHas Code
Originality Highly original
AI Analysis

This addresses a fundamental optimization problem in machine learning for researchers and practitioners, offering a direct solution with theoretical guarantees, though it is incremental in improving upon existing approximations.

The paper tackles the constrained ℓp regression problem, which is non-convex and typically avoided via ridge regression, by providing the first provable constant factor approximation algorithm that solves it directly for constant p and d, with an O(n log n) running time and extensions for streaming and distributed data.

The $\ell_p$ linear regression problem is to minimize $f(x)=||Ax-b||_p$ over $x\in\mathbb{R}^d$, where $A\in\mathbb{R}^{n\times d}$, $b\in \mathbb{R}^n$, and $p>0$. To avoid overfitting and bound $||x||_2$, the constrained $\ell_p$ regression minimizes $f(x)$ over every unit vector $x\in\mathbb{R}^d$. This makes the problem non-convex even for the simplest case $d=p=2$. Instead, ridge regression is used to minimize the Lagrange form $f(x)+λ||x||_2$ over $x\in\mathbb{R}^d$, which yields a convex problem in the price of calibrating the regularization parameter $λ>0$. We provide the first provable constant factor approximation algorithm that solves the constrained $\ell_p$ regression directly, for every constant $p,d\geq 1$. Using core-sets, its running time is $O(n \log n)$ including extensions for streaming and distributed (big) data. In polynomial time, it can handle outliers, $p\in (0,1)$ and minimize $f(x)$ over every $x$ and permutation of rows in $A$. Experimental results are also provided, including open source and comparison to existing software.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes