LGCVApr 15, 2021

See through Gradients: Image Batch Recovery via GradInversion

arXiv:2104.07586v1601 citations
Originality Highly original
AI Analysis

This work reveals a significant privacy vulnerability in collaborative and federated learning settings, showing that gradient averaging does not fully protect data, which is a foundational issue for secure machine learning.

The authors tackled the problem of recovering original input images from averaged gradients in deep neural network training, which was previously thought to be safe for privacy, and demonstrated that their GradInversion method can recover images from batches of 8-48 images with high fidelity on complex datasets like ImageNet using ResNets.

Training deep neural networks requires gradient estimation from data batches to update parameters. Gradients per parameter are averaged over a set of data and this has been presumed to be safe for privacy-preserving training in joint, collaborative, and federated learning applications. Prior work only showed the possibility of recovering input data given gradients under very restrictive conditions - a single input point, or a network with no non-linearities, or a small 32x32 px input batch. Therefore, averaging gradients over larger batches was thought to be safe. In this work, we introduce GradInversion, using which input images from a larger batch (8 - 48 images) can also be recovered for large networks such as ResNets (50 layers), on complex datasets such as ImageNet (1000 classes, 224x224 px). We formulate an optimization task that converts random noise into natural images, matching gradients while regularizing image fidelity. We also propose an algorithm for target class label recovery given gradients. We further propose a group consistency regularization framework, where multiple agents starting from different random seeds work together to find an enhanced reconstruction of original data batch. We show that gradients encode a surprisingly large amount of information, such that all the individual images can be recovered with high fidelity via GradInversion, even for complex datasets, deep networks, and large batch sizes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes