LGMLDec 13, 2021

How to Learn when Data Gradually Reacts to Your Model

arXiv:2112.07042v134 citations
Originality Incremental advance
AI Analysis

This addresses a practical limitation in performative ML for applications where populations adapt slowly, offering an incremental improvement over existing methods.

The paper tackles the problem of training machine learning models when data distributions react gradually to deployed models, rather than immediately, by proposing Stateful Performative Gradient Descent (Stateful PerfGD). The result shows that Stateful PerfGD substantially outperforms previous state-of-the-art methods, with theoretical convergence guarantees.

A recent line of work has focused on training machine learning (ML) models in the performative setting, i.e. when the data distribution reacts to the deployed model. The goal in this setting is to learn a model which both induces a favorable data distribution and performs well on the induced distribution, thereby minimizing the test loss. Previous work on finding an optimal model assumes that the data distribution immediately adapts to the deployed model. In practice, however, this may not be the case, as the population may take time to adapt to the model. In many applications, the data distribution depends on both the currently deployed ML model and on the "state" that the population was in before the model was deployed. In this work, we propose a new algorithm, Stateful Performative Gradient Descent (Stateful PerfGD), for minimizing the performative loss even in the presence of these effects. We provide theoretical guarantees for the convergence of Stateful PerfGD. Our experiments confirm that Stateful PerfGD substantially outperforms previous state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes