LGCCCRDSMLNov 24, 2020

InstaHide's Sample Complexity When Mixing Two Private Images

arXiv:2011.11877v29 citations
AI Analysis

This work addresses the security of InstaHide for deep learning researchers and practitioners concerned with data privacy, showing its information-theoretic insecurity but computational security in specific scenarios.

This paper investigates the security of InstaHide, a privacy-preserving scheme for neural network training data. The authors propose a new algorithm that can recover all private images from InstaHide mixtures of two images with provable guarantees and optimal sample complexity, demonstrating that InstaHide is not information-theoretically secure.

Training neural networks usually require large numbers of sensitive training data, and how to protect the privacy of training data has thus become a critical topic in deep learning research. InstaHide is a state-of-the-art scheme to protect training data privacy with only minor effects on test accuracy, and its security has become a salient question. In this paper, we systematically study recent attacks on InstaHide and present a unified framework to understand and analyze these attacks. We find that existing attacks either do not have a provable guarantee or can only recover a single private image. On the current InstaHide challenge setup, where each InstaHide image is a mixture of two private images, we present a new algorithm to recover all the private images with a provable guarantee and optimal sample complexity. In addition, we also provide a computational hardness result on retrieving all InstaHide images. Our results demonstrate that InstaHide is not information-theoretically secure but computationally secure in the worst case, even when mixing two private images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes