LGCRJan 27, 2023

Practical Differentially Private Hyperparameter Tuning with Subsampling

arXiv:2301.11989v326 citationsh-index: 21
Originality Incremental advance
AI Analysis

This work addresses privacy and efficiency challenges in DP machine learning for practitioners, though it is incremental as it builds on existing tuning algorithms.

The paper tackles the problem of high privacy cost and computational overhead in differentially private hyperparameter tuning by using random subsets of data and extrapolation, achieving a better privacy-utility trade-off than baseline methods.

Tuning the hyperparameters of differentially private (DP) machine learning (ML) algorithms often requires use of sensitive data and this may leak private information via hyperparameter values. Recently, Papernot and Steinke (2022) proposed a certain class of DP hyperparameter tuning algorithms, where the number of random search samples is randomized itself. Commonly, these algorithms still considerably increase the DP privacy parameter $\varepsilon$ over non-tuned DP ML model training and can be computationally heavy as evaluating each hyperparameter candidate requires a new training run. We focus on lowering both the DP bounds and the computational cost of these methods by using only a random subset of the sensitive data for the hyperparameter tuning and by extrapolating the optimal values to a larger dataset. We provide a Rényi differential privacy analysis for the proposed method and experimentally show that it consistently leads to better privacy-utility trade-off than the baseline method by Papernot and Steinke.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes