ML LG OCOct 24, 2015

Fast and Scalable Lasso via Stochastic Frank-Wolfe Methods with a Convergence Guarantee

Emanuele Frandi, Ricardo Nanculef, Stefano Lodi, Claudio Sartori, Johan A. K. Suykens

arXiv:1510.07169v18.622 citationsHas Code

Originality Incremental advance

AI Analysis

This provides a faster and scalable solution for Lasso regression in high-dimensional machine learning applications, though it is incremental as it builds on existing Frank-Wolfe methods.

The authors tackled the problem of solving large-scale Lasso regression efficiently by developing a stochastic Frank-Wolfe method, achieving results such as generating the regularization path for up to four million variables in under a minute and outperforming state-of-the-art solvers like Coordinate Descent.

Frank-Wolfe (FW) algorithms have been often proposed over the last few years as efficient solvers for a variety of optimization problems arising in the field of Machine Learning. The ability to work with cheap projection-free iterations and the incremental nature of the method make FW a very effective choice for many large-scale problems where computing a sparse model is desirable. In this paper, we present a high-performance implementation of the FW method tailored to solve large-scale Lasso regression problems, based on a randomized iteration, and prove that the convergence guarantees of the standard FW method are preserved in the stochastic setting. We show experimentally that our algorithm outperforms several existing state of the art methods, including the Coordinate Descent algorithm by Friedman et al. (one of the fastest known Lasso solvers), on several benchmark datasets with a very large number of features, without sacrificing the accuracy of the model. Our results illustrate that the algorithm is able to generate the complete regularization path on problems of size up to four million variables in less than one minute.

View on arXiv PDF Code

Similar