SY LGNov 14, 2022

Follow the Clairvoyant: an Imitation Learning Approach to Optimal Control

Andrea Martin, Luca Furieri, Florian Dörfler, John Lygeros, Giancarlo Ferrari-Trecate

arXiv:2211.07389v11.25 citationsh-index: 79Has Code

Originality Highly original

AI Analysis

This work addresses control system design for scenarios with constraints, offering improved performance over existing methods.

The paper tackles the problem of controlling dynamical systems by proposing a method to minimize tracking error relative to optimal trajectories in hindsight, imitating a clairvoyant policy, and shows that it outperforms regret minimization methods in constrained scenarios.

We consider control of dynamical systems through the lens of competitive analysis. Most prior work in this area focuses on minimizing regret, that is, the loss relative to an ideal clairvoyant policy that has noncausal access to past, present, and future disturbances. Motivated by the observation that the optimal cost only provides coarse information about the ideal closed-loop behavior, we instead propose directly minimizing the tracking error relative to the optimal trajectories in hindsight, i.e., imitating the clairvoyant policy. By embracing a system level perspective, we present an efficient optimization-based approach for computing follow-the-clairvoyant (FTC) safe controllers. We prove that these attain minimal regret if no constraints are imposed on the noncausal benchmark. In addition, we present numerical experiments to show that our policy retains the hallmark of competitive algorithms of interpolating between classical $\mathcal{H}_2$ and $\mathcal{H}_\infty$ control laws - while consistently outperforming regret minimization methods in constrained scenarios thanks to the superior ability to chase the clairvoyant.

View on arXiv PDF Code

Similar