CR AI GTJan 22, 2022

Long-term Data Sharing under Exclusivity Attacks

arXiv:2201.09137v110.13 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a security challenge for companies and institutions using shared data platforms, but it is incremental as it builds on existing protocol analysis.

The paper tackles the problem of data sharing platforms being vulnerable to exclusivity attacks, where a firm shares distorted data to gain a superior model while misleading others, and finds that protocol choice and the number of Sybil identities significantly affect vulnerability in regression and clustering tasks.

The quality of learning generally improves with the scale and diversity of data. Companies and institutions can therefore benefit from building models over shared data. Many cloud and blockchain platforms, as well as government initiatives, are interested in providing this type of service. These cooperative efforts face a challenge, which we call ``exclusivity attacks''. A firm can share distorted data, so that it learns the best model fit, but is also able to mislead others. We study protocols for long-term interactions and their vulnerability to these attacks, in particular for regression and clustering tasks. We conclude that the choice of protocol, as well as the number of Sybil identities an attacker may control, is material to vulnerability.

View on arXiv PDF Code

Similar