CRNov 5, 2020

FederBoost: Private Federated Learning for GBDT

arXiv:2011.02796v485 citations
AI Analysis

This provides a practical solution for industrial applications needing efficient and private federated learning with GBDT, though it is incremental in improving speed over existing methods.

The paper tackles the problem of slow and computationally heavy private federated learning for gradient boosting decision trees (GBDT) by proposing FederBoost, a framework that supports both vertically and horizontally partitioned data without heavy cryptography. The result shows that FederBoost achieves the same accuracy as centralized training while being 4-5 orders of magnitude faster than state-of-the-art solutions.

Federated Learning (FL) has been an emerging trend in machine learning and artificial intelligence. It allows multiple participants to collaboratively train a better global model and offers a privacy-aware paradigm for model training since it does not require participants to release their original training data. However, existing FL solutions for vertically partitioned data or decision trees require heavy cryptographic operations. In this paper, we propose a framework named FederBoost for private federated learning of gradient boosting decision trees (GBDT). It supports running GBDT over both vertically and horizontally partitioned data. Vertical FederBoost does not require any cryptographic operation and horizontal FederBoost only requires lightweight secure aggregation. The key observation is that the whole training process of GBDT relies on the ordering of the data instead of the values. We fully implement FederBoost and evaluate its utility and efficiency through extensive experiments performed on three public datasets. Our experimental results show that both vertical and horizontal FederBoost achieve the same level of accuracy with centralized training where all data are collected in a central server, and they are 4-5 orders of magnitude faster than the state-of-the-art solutions for federated decision tree training; hence offering practical solutions for industrial applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes