LGJul 30, 2024

Improved Bounds for Pure Private Agnostic Learning: Item-Level and User-Level Privacy

arXiv:2407.20640v21 citationsh-index: 4
AI Analysis

This work addresses privacy protection in machine learning for scenarios involving sensitive data, offering incremental improvements in theoretical bounds for agnostic learning.

The paper tackles the problem of private agnostic learning by deriving improved upper bounds on the number of users required under item-level and user-level privacy, achieving near-optimal bounds for general concept classes and tighter results than prior work.

Machine Learning has made remarkable progress in a wide range of fields. In many scenarios, learning is performed on datasets involving sensitive information, in which privacy protection is essential for learning algorithms. In this work, we study pure private learning in the agnostic model -- a framework reflecting the learning process in practice. We examine the number of users required under item-level (where each user contributes one example) and user-level (where each user contributes multiple examples) privacy and derive several improved upper bounds. For item-level privacy, our algorithm achieves a near optimal bound for general concept classes. We extend this to the user-level setting, rendering a tighter upper bound than the one proved by Ghazi et al. (2023). Lastly, we consider the problem of learning thresholds under user-level privacy and present an algorithm with a nearly tight user complexity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes