LGGTMLFeb 8, 2023

Algorithmic Collective Action in Machine Learning

arXiv:2302.04262v339 citationsh-index: 54
AI Analysis

This addresses the issue of algorithmic manipulation for users on digital platforms, presenting a novel theoretical and experimental framework that is foundational rather than incremental.

The paper tackles the problem of how collectives can influence machine learning algorithms on digital platforms by coordinating data modifications, showing that even small collectives can significantly control a platform's algorithm, with empirical validation on a skill classification task involving tens of thousands of resumes and over two thousand training runs.

We initiate a principled study of algorithmic collective action on digital platforms that deploy machine learning algorithms. We propose a simple theoretical model of a collective interacting with a firm's learning algorithm. The collective pools the data of participating individuals and executes an algorithmic strategy by instructing participants how to modify their own data to achieve a collective goal. We investigate the consequences of this model in three fundamental learning-theoretic settings: the case of a nonparametric optimal learning algorithm, a parametric risk minimizer, and gradient-based optimization. In each setting, we come up with coordinated algorithmic strategies and characterize natural success criteria as a function of the collective's size. Complementing our theory, we conduct systematic experiments on a skill classification task involving tens of thousands of resumes from a gig platform for freelancers. Through more than two thousand model training runs of a BERT-like language model, we see a striking correspondence emerge between our empirical observations and the predictions made by our theory. Taken together, our theory and experiments broadly support the conclusion that algorithmic collectives of exceedingly small fractional size can exert significant control over a platform's learning algorithm.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes