OCSYSYApr 10

A Bayesian Perspective on the Data-Driven LQR

arXiv:2604.0907419.8h-index: 2
AI Analysis

This work addresses control design for unknown dynamical systems, offering a principled approach to handle uncertainty, but it is incremental as it builds on existing data-driven LQR methods.

The paper tackles the problem of data-driven linear quadratic control under model uncertainty by proposing a Bayesian formulation that incorporates posterior uncertainty into control design, showing improved optimality gap and closed-loop stability in simulations, especially with limited data.

The data-driven linear quadratic regulator (ddLQR) is a widely studied control method for unknown dynamical systems with disturbance. Existing approaches, both indirect, i.e., those that identify a model followed by model-based design, and direct, which bypasses the identification step, often rely on the certainty-equivalence principle and therefore do not explicitly account for model uncertainty. In this paper, we propose a Bayesian formulation for both indirect and direct ddLQR that incorporates posterior uncertainty into the control design. The resulting expected cost decomposes into a certainty-equivalence term and a variance-dependent term, providing a principled interpretation of regularization. We further show that the indirect and direct formulations are equivalent under this perspective. The resulting direct method admits a tractable semidefinite program whose size is independent of the data length. Numerical simulations demonstrate improved optimality gap and closed-loop stability, particularly in low-data regimes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes