OC LGFeb 19, 2025

Population Dynamics Control with Partial Observations

arXiv:2502.14079v19.42 citationsh-index: 2

Originality Incremental advance

AI Analysis

This work solves a control problem for systems with limited state information, but it is incremental as it extends prior fully observable methods to the partially observable case.

The paper tackles controlling population dynamics under partial observations by addressing challenges like constructing oblivious signals under simplex constraints and designing convex controllers, achieving optimal $ ilde{O}(\sqrt{T})$ regret compared to mixing linear dynamic controllers.

We study the problem of controlling population dynamics, a class of linear dynamical systems evolving on the probability simplex, from the perspective of online non-stochastic control. While Golowich et.al. 2024 analyzed the fully observable setting, we focus on the more realistic, partially observable case, where only a low-dimensional representation of the state is accessible. In classical non-stochastic control, inputs are set as linear combinations of past disturbances. However, under partial observations, disturbances cannot be directly computed. To address this, Simchowitz et.al. 2020 proposed to construct oblivious signals, which are counterfactual observations with zero control, as a substitute. This raises several challenges in our setting: (1) how to construct oblivious signals under simplex constraints, where zero control is infeasible; (2) how to design a sufficiently expressive convex controller parameterization tailored to these signals; and (3) how to enforce the simplex constraint on control when projections may break the convexity of cost functions. Our main contribution is a new controller that achieves the optimal $\tilde{O}(\sqrt{T})$ regret with respect to a natural class of mixing linear dynamic controllers. To tackle these challenges, we construct signals based on hypothetical observations under a constant control adapted to the simplex domain, and introduce a new controller parameterization that approximates general control policies linear in non-oblivious observations. Furthermore, we employ a novel convex extension surrogate loss, inspired by Lattimore 2024, to bypass the projection-induced convexity issue.

View on arXiv PDF

Similar