CRCYSep 6, 2020

Randomness Concerns When Deploying Differential Privacy

arXiv:2009.03777v128 citations
Originality Synthesis-oriented
AI Analysis

This work tackles a critical infrastructure problem for government agencies deploying privacy-preserving systems at scale, though it is incremental in building on existing randomness methods.

The paper addresses the challenge of generating massive amounts of random bytes (roughly 90TB) required for deploying differential privacy in the 2020 U.S. Census, and proposes a hybrid solution using AES-CTR-DRBG seeded by mixing /dev/urandom and Intel RDSEED to ensure secure randomness.

The U.S. Census Bureau is using differential privacy (DP) to protect confidential respondent data collected for the 2020 Decennial Census of Population & Housing. The Census Bureau's DP system is implemented in the Disclosure Avoidance System (DAS) and requires a source of random numbers. We estimate that the 2020 Census will require roughly 90TB of random bytes to protect the person and household tables. Although there are critical differences between cryptography and DP, they have similar requirements for randomness. We review the history of random number generation on deterministic computers, including von Neumann's "middle-square" method, Mersenne Twister (MT19937) (previously the default NumPy random number generator, which we conclude is unacceptable for use in production privacy-preserving systems), and the Linux /dev/urandom device. We also review hardware random number generator schemes, including the use of so-called "Lava Lamps" and the Intel Secure Key RDRAND instruction. We finally present our plan for generating random bits in the Amazon Web Services (AWS) environment using AES-CTR-DRBG seeded by mixing bits from /dev/urandom and the Intel Secure Key RDSEED instruction, a compromise of our desire to rely on a trusted hardware implementation, the unease of our external reviewers in trusting a hardware-only implementation, and the need to generate so many random bits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes