LGCRGTJan 30, 2023

The Fair Value of Data Under Heterogeneous Privacy Constraints in Federated Learning

arXiv:2301.13336v26 citationsh-index: 89
Originality Incremental advance
AI Analysis

This addresses the challenge of designing equitable data-sharing incentives for users with varying privacy preferences in federated learning, offering a novel fairness framework but with incremental impact on practical implementations.

The paper tackles the problem of fairly compensating users for their data in federated learning under heterogeneous privacy constraints, proposing an axiomatic fairness concept based on the Shapley value and analyzing how platforms set incentives based on privacy sensitivity, data amounts, and heterogeneity.

Modern data aggregation often involves a platform collecting data from a network of users with various privacy options. Platforms must solve the problem of how to allocate incentives to users to convince them to share their data. This paper puts forth an idea for a \textit{fair} amount to compensate users for their data at a given privacy level based on an axiomatic definition of fairness, along the lines of the celebrated Shapley value. To the best of our knowledge, these are the first fairness concepts for data that explicitly consider privacy constraints. We also formulate a heterogeneous federated learning problem for the platform with privacy level options for users. By studying this problem, we investigate the amount of compensation users receive under fair allocations with different privacy levels, amounts of data, and degrees of heterogeneity. We also discuss what happens when the platform is forced to design fair incentives. Under certain conditions we find that when privacy sensitivity is low, the platform will set incentives to ensure that it collects all the data with the lowest privacy options. When the privacy sensitivity is above a given threshold, the platform will provide no incentives to users. Between these two extremes, the platform will set the incentives so some fraction of the users chooses the higher privacy option and the others chooses the lower privacy option.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes