LG CR CV DCSep 24, 2021

NanoBatch Privacy: Enabling fast Differentially Private learning on the IPU

Edward H. Lee, Mario Michael Krell, Alexander Tsyplikhin, Victoria Rege, Errol Colak, Kristen W. Yeom

arXiv:2109.12191v23.11 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of slow private training for machine learning practitioners, offering a domain-specific optimization that is incremental in improving efficiency.

The paper tackles the computational overhead of Differentially Private SGD (DPSGD) in deep learning by proposing NanoBatch Privacy, a lightweight add-on to TensorFlow Privacy that uses batch size of 1 and gradient accumulation on Graphcore IPUs, achieving over 15x speedup on ImageNet compared to existing methods.

Differentially private SGD (DPSGD) has recently shown promise in deep learning. However, compared to non-private SGD, the DPSGD algorithm places computational overheads that can undo the benefit of batching in GPUs. Micro-batching is a common method to alleviate this and is fully supported in the TensorFlow Privacy library (TFDP). However, it degrades accuracy. We propose NanoBatch Privacy, a lightweight add-on to TFDP to be used on Graphcore IPUs by leveraging batch size of 1 (without microbatching) and gradient accumulation. This allows us to achieve large total batch sizes with minimal impacts to throughput. Second, we illustrate using Cifar-10 how larger batch sizes are not necessarily optimal from a privacy versus utility perspective. On ImageNet, we achieve more than 15x speedup over TFDP versus 8x A100s and significant speedups even across libraries such as Opacus. We also provide two extensions: 1) DPSGD for pipelined models and 2) per-layer clipping that is 15x faster than the Opacus implementation on 8x A100s. Finally as an application case study, we apply NanoBatch training for use on private Covid-19 chest CT prediction.

View on arXiv PDF

Similar