SYLGMay 29, 2020

On Regularizability and its Application to Online Control of Unstable LTI Systems

arXiv:2006.00125v315 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of controlling unstable systems in real-time for applications like aircraft control, but it appears incremental as it builds on existing control theory with a new synthesis method.

The paper tackles the problem of online regulation of unstable linear systems without needing an initial stabilizing controller or persistently exciting data, by introducing the concept of 'regularizability' and proposing the Data-Guided Regulation (DGR) synthesis procedure. The result includes a computational improvement via a rank-one update and demonstration on an X-29 aircraft, though no concrete numbers are provided.

Learning, say through direct policy updates, often requires assumptions such as knowing a priori that the initial policy (gain) is stabilizing, or persistently exciting (PE) input-output data, is available. In this paper, we examine online regulation of (possibly unstable) partially unknown linear systems with no prior access to an initial stabilizing controller nor PE input-output data; we instead leverage the knowledge of the input matrix for online regulation. First, we introduce and characterize the notion of "regularizability" for linear systems that gauges the extent by which a system can be regulated in finite-time in contrast to its asymptotic behavior (commonly characterized by stabilizability/controllability). Next, having access only to the input matrix, we propose the Data-Guided Regulation (DGR) synthesis procedure that -- as its name suggests -- regulates the underlying state while also generating informative data that can subsequently be used for data-driven stabilization or system identification. We further improve the computational performance of DGR via a rank-one update and demonstrate its utility in online regulation of the X-29 aircraft.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes