DC CR LGFeb 16, 2020

Distributed Sketching Methods for Privacy Preserving Regression

arXiv:2002.06538v29.212 citationsh-index: 14

Originality Incremental advance

AI Analysis

This work addresses privacy and efficiency challenges in distributed machine learning, though it appears incremental as it builds on classical sketching methods with novel guarantees.

The authors tackled large-scale regression problems by developing distributed sketching methods that reduce dimensionality while preserving privacy and improving straggler resilience in asynchronous systems, achieving computational efficiency through a hybrid approach validated in serverless computing experiments.

In this work, we study distributed sketching methods for large scale regression problems. We leverage multiple randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching methods and analyze the accuracy of parameter averaging for distributed sketches. We consider random matrices including Gaussian, randomized Hadamard, uniform sampling and leverage score sampling in the distributed setting. Moreover, we propose a hybrid approach combining sampling and fast random projections for better computational efficiency. We illustrate the performance of distributed sketches in a serverless computing platform with large scale experiments.

View on arXiv PDF

Similar