DCCRLGFeb 16, 2020

Distributed Sketching Methods for Privacy Preserving Regression

arXiv:2002.06538v212 citations
Originality Incremental advance
AI Analysis

This work addresses privacy and efficiency challenges in distributed machine learning, though it appears incremental as it builds on classical sketching methods with novel guarantees.

The authors tackled large-scale regression problems by developing distributed sketching methods that reduce dimensionality while preserving privacy and improving straggler resilience in asynchronous systems, achieving computational efficiency through a hybrid approach validated in serverless computing experiments.

In this work, we study distributed sketching methods for large scale regression problems. We leverage multiple randomized sketches for reducing the problem dimensions as well as preserving privacy and improving straggler resilience in asynchronous distributed systems. We derive novel approximation guarantees for classical sketching methods and analyze the accuracy of parameter averaging for distributed sketches. We consider random matrices including Gaussian, randomized Hadamard, uniform sampling and leverage score sampling in the distributed setting. Moreover, we propose a hybrid approach combining sampling and fast random projections for better computational efficiency. We illustrate the performance of distributed sketches in a serverless computing platform with large scale experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes