LGMLFeb 17, 2019

Learning Linear-Quadratic Regulators Efficiently with only $\sqrt{T}$ Regret

arXiv:1902.06223v2185 citations
AI Analysis

This solves a long-standing open problem in control theory for researchers and practitioners, providing an efficient method with optimal regret bounds.

The authors tackled the problem of learning in Linear Quadratic Control systems with unknown dynamics, achieving the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret, which resolves an open question from prior work.

We present the first computationally-efficient algorithm with $\widetilde O(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown dynamics. By that, we resolve an open question of Abbasi-Yadkori and Szepesvári (2011) and Dean, Mania, Matni, Recht, and Tu (2018).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes