LG STJan 9, 2023

Distributed Sparse Linear Regression under Communication Constraints

arXiv:2301.04022v23.82 citationsh-index: 43

Originality Incremental advance

AI Analysis

This addresses the problem of efficient distributed learning for applications with limited bandwidth and power, though it is incremental in improving communication efficiency.

The paper tackles distributed sparse linear regression under severe communication constraints by proposing two-round schemes where machines send only a few values, achieving exact support recovery at low signal-to-noise ratios where individual machines fail.

In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and thus a tight communication budget. In this work we focus on distributed learning of a sparse linear regression model, under severe communication constraints. We propose several two round distributed schemes, whose communication per machine is sublinear in the data dimension. In our schemes, individual machines compute debiased lasso estimators, but send to the fusion center only very few values. On the theoretical front, we analyze one of these schemes and prove that with high probability it achieves exact support recovery at low signal to noise ratios, where individual machines fail to recover the support. We show in simulations that our scheme works as well as, and in some cases better, than more communication intensive approaches.

View on arXiv PDF

Similar