LGCRDCNov 22, 2019

Parallel Distributed Logistic Regression for Vertical Federated Learning without Third-Party Coordinator

arXiv:1911.09824v1144 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency and privacy concerns in federated learning for scenarios where data is partitioned by features across different parties, though it appears incremental as it builds on existing parameter server methods.

The paper tackles the problem of training logistic regression models in vertical federated learning by removing the need for a third-party coordinator, using a parameter server architecture to speed up training on large datasets, with experimental results demonstrating great scalability.

Federated Learning is a new distributed learning mechanism which allows model training on a large corpus of decentralized data owned by different data providers, without sharing or leakage of raw data. According to the characteristics of data dis-tribution, it could be usually classified into three categories: horizontal federated learning, vertical federated learning, and federated transfer learning. In this paper we present a solution for parallel dis-tributed logistic regression for vertical federated learning. As compared with existing works, the role of third-party coordinator is removed in our proposed solution. The system is built on the pa-rameter server architecture and aims to speed up the model training via utilizing a cluster of servers in case of large volume of training data. We also evaluate the performance of the parallel distributed model training and the experimental results show the great scalability of the system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes