LGCLCRJul 14, 2023

Population Expansion for Training Language Models with Private Federated Learning

arXiv:2307.07477v13 citationsh-index: 43
Originality Incremental advance
AI Analysis

This work addresses efficiency and performance issues for applications with limited device availability in privacy-preserving machine learning, representing an incremental improvement.

The paper tackles the problem of degraded model utility and increased training latency in private federated learning with small populations by proposing population expansion using domain adaptation techniques, resulting in a 13% to 30% improvement in utility on real-world language modeling datasets.

Federated learning (FL) combined with differential privacy (DP) offers machine learning (ML) training with distributed devices and with a formal privacy guarantee. With a large population of devices, FL with DP produces a performant model in a timely manner. However, for applications with a smaller population, not only does the model utility degrade as the DP noise is inversely proportional to population, but also the training latency increases since waiting for enough clients to become available from a smaller pool is slower. In this work, we thus propose expanding the population based on domain adaptation techniques to speed up the training and improves the final model quality when training with small populations. We empirically demonstrate that our techniques can improve the utility by 13% to 30% on real-world language modeling datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes