OCLGSYSep 29, 2020

Distributed Online Linear Quadratic Control for Linear Time-invariant Systems

arXiv:2009.13749v18 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of distributed control in multi-agent networks for applications like robotics or power systems, but it is incremental as it extends existing online LQ methods to a distributed setting.

The paper tackles the distributed online linear quadratic control problem for identical linear time-invariant systems with time-varying costs, achieving a regret bound that scales as the square root of the time horizon, which implies consensus among agents over time.

Classical linear quadratic (LQ) control centers around linear time-invariant (LTI) systems, where the control-state pairs introduce a quadratic cost with time-invariant parameters. Recent advancement in online optimization and control has provided novel tools to study LQ problems that are robust to time-varying cost parameters. Inspired by this line of research, we study the distributed online LQ problem for identical LTI systems. Consider a multi-agent network where each agent is modeled as an LTI system. The LTI systems are associated with decoupled, time-varying quadratic costs that are revealed sequentially. The goal of the network is to make the control sequence of all agents competitive to that of the best centralized policy in hindsight, captured by the notion of regret. We develop a distributed variant of the online LQ algorithm, which runs distributed online gradient descent with a projection to a semi-definite programming (SDP) to generate controllers. We establish a regret bound scaling as the square root of the finite time-horizon, implying that agents reach consensus as time grows. We further provide numerical experiments verifying our theoretical result.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes