LG MLMay 27, 2019

Parallel and Communication Avoiding Least Angle Regression

S. Das, J. Demmel, K. Fountoulakis, L. Grigori, M. W. Mahoney, S. Yang

arXiv:1905.11340v33.43 citations

Originality Incremental advance

AI Analysis

This work addresses the computational bottleneck of LARS for large-scale linear regression, offering incremental improvements in parallel efficiency for data analysis applications.

The authors tackled the problem of parallelizing the Least Angle Regression (LARS) algorithm for high-dimensional data by proposing two communication-avoiding versions, bLARS and T-bLARS, which achieved speedups up to 4x without compromising solution quality.

We are interested in parallelizing the Least Angle Regression (LARS) algorithm for fitting linear regression models to high-dimensional data. We consider two parallel and communication avoiding versions of the basic LARS algorithm. The two algorithms have different asymptotic costs and practical performance. One offers more speedup and the other produces more accurate output. The first is bLARS, a block version of LARS algorithm, where we update b columns at each iteration. Assuming that the data are row-partitioned, bLARS reduces the number of arithmetic operations, latency, and bandwidth by a factor of b. The second is Tournament-bLARS (T-bLARS), a tournament version of LARS where processors compete by running several LARS computations in parallel to choose b new columns to be added in the solution. Assuming that the data are column-partitioned, T-bLARS reduces latency by a factor of b. Similarly to LARS, our proposed methods generate a sequence of linear models. We present extensive numerical experiments that illustrate speedups up to 4x compared to LARS without any compromise in solution quality.

View on arXiv PDF

Similar