OCLGSYJan 7, 2022

Stochastic Saddle Point Problems with Decision-Dependent Distributions

arXiv:2201.02313v328 citations
Originality Incremental advance
AI Analysis

This addresses optimization challenges in dynamic environments like reinforcement learning, but it is incremental as it builds on existing saddle point methods with distributional shifts.

The paper tackles stochastic saddle point problems where the data distribution shifts with decisions, introducing equilibrium points as fixed points of repeated retraining and providing conditions for their existence and uniqueness. It develops primal-dual algorithms with convergence guarantees and error bounds, and proposes a condition to ensure strong convexity-concavity for tractability.

This paper focuses on stochastic saddle point problems with decision-dependent distributions. These are problems whose objective is the expected value of a stochastic payoff function and whose data distribution drifts in response to decision variables--a phenomenon represented by a distributional map. A common approach to accommodating distributional shift is to retrain optimal decisions once a new distribution is revealed, or repeated retraining. We introduce the notion of equilibrium points, which are the fixed points of this repeated retraining procedure, and provide sufficient conditions for their existence and uniqueness. To find equilibrium points, we develop deterministic and stochastic primal-dual algorithms and demonstrate their convergence with constant step-size in the former and polynomial decay step-size schedule in the latter. By modeling errors emerging from a stochastic gradient estimator as sub-Weibull random variables, we provide error bounds in expectation and in high probability that hold for each iteration. Without additional knowledge of the distributional map, computing saddle points is intractable. Thus we propose a condition on the distributional map--which we call opposing mixture dominance--that ensures that the objective is strongly-convex-strongly-concave. Finally, we demonstrate that derivative-free algorithms with a single function evaluation are capable of approximating saddle points

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes