ML LG OC MEJan 29, 2019

Rank-one Convexification for Sparse Regression

arXiv:1901.10334v215.149 citations

Originality Highly original

AI Analysis

This work addresses the challenge of efficiently solving sparse regression problems for researchers and practitioners in machine learning, offering a novel relaxation method that enhances both computational and statistical performance.

The paper tackles the NP-hard problem of sparse regression with an ℓ0 constraint by deriving new strong convex relaxations based on rank-one quadratic terms, resulting in near-optimal solutions with a 0.4% optimality gap and improved prediction accuracy compared to alternative convex approaches.

Sparse regression models are increasingly prevalent due to their ease of interpretability and superior out-of-sample performance. However, the exact model of sparse regression with an $\ell_0$ constraint restricting the support of the estimators is a challenging (\NP-hard) non-convex optimization problem. In this paper, we derive new strong convex relaxations for sparse regression. These relaxations are based on the ideal (convex-hull) formulations for rank-one quadratic terms with indicator variables. The new relaxations can be formulated as semidefinite optimization problems in an extended space and are stronger and more general than the state-of-the-art formulations, including the perspective reformulation and formulations with the reverse Huber penalty and the minimax concave penalty functions. Furthermore, the proposed rank-one strengthening can be interpreted as a \textit{non-separable, non-convex, unbiased} sparsity-inducing regularizer, which dynamically adjusts its penalty according to the shape of the error function without inducing bias for the sparse solutions. In our computational experiments with benchmark datasets, the proposed conic formulations are solved within seconds and result in near-optimal solutions (with 0.4\% optimality gap) for non-convex $\ell_0$-problems. Moreover, the resulting estimators also outperform alternative convex approaches from a statistical perspective, achieving high prediction accuracy and good interpretability.

View on arXiv PDF

Similar