OCJan 12, 2012
Global stabilization of nonlinear systems based on vector control lyapunov functionsIasson Karafyllis, Zhong-Ping Jiang
This paper studies the use of vector Lyapunov functions for the design of globally stabilizing feedback laws for nonlinear systems. Recent results on vector Lyapunov functions are utilized. The main result of the paper shows that the existence of a vector control Lyapunov function is a necessary and sufficient condition for the existence of a smooth globally stabilizing feedback. Applications to nonlinear systems are provided: simple and easily checkable sufficient conditions are proposed to guarantee the existence of a smooth globally stabilizing feedback law. The obtained results are applied to the problem of the stabilization of an equilibrium point of a reaction network taking place in a continuous stirred tank reactor.
SYJul 16, 2021
Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic SystemsBo Pang, Zhong-Ping Jiang
This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to find iteratively near-optimal policies of the adaptive optimal stationary control problem directly from input/state data without explicitly identifying any system matrices, starting from an initial admissible control policy. The solutions given by the proposed optimistic least-squares-based policy iteration are proved to converge to a small neighborhood of the optimal solution with probability one, under mild conditions. The application of the proposed algorithm to a triple inverted pendulum example validates its feasibility and effectiveness.
OCAug 25, 2020
Robust Reinforcement Learning: A Case Study in Linear Quadratic RegulationBo Pang, Zhong-Ping Jiang
This paper studies the robustness of reinforcement learning algorithms to errors in the learning process. Specifically, we revisit the benchmark problem of discrete-time linear quadratic regulation (LQR) and study the long-standing open question: Under what conditions is the policy iteration method robustly stable from a dynamical systems perspective? Using advanced stability results in control theory, it is shown that policy iteration for LQR is inherently robust to small errors in the learning process and enjoys small-disturbance input-to-state stability: whenever the error in each iteration is bounded and small, the solutions of the policy iteration algorithm are also bounded, and, moreover, enter and stay in a small neighbourhood of the optimal LQR solution. As an application, a novel off-policy optimistic least-squares policy iteration for the LQR problem is proposed, when the system dynamics are subjected to additive stochastic disturbances. The proposed new results in robust reinforcement learning are validated by a numerical example.
LGJun 1, 2020
Temporal-Differential Learning in Continuous EnvironmentsTao Bian, Zhong-Ping Jiang
In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.
SYJun 29, 2019
Distributed Global Output-Feedback Control for a Class of Euler-Lagrange SystemsQingkai Yang, Hao Fang, Jie Chen et al.
This published paper investigates the distributed tracking control problem for a class of Euler-Lagrange multi-agent systems when the agents can only measure the positions. In this case, the lack of the separation principle and the strong nonlinearity in unmeasurable states pose severe technical challenges to global output-feedback control design. To overcome these difficulties, a global nonsingular coordinate transformation matrix in the upper triangular form is firstly proposed such that the nonlinear dynamic model can be partially linearized with respect to the unmeasurable states. And, a new type of velocity observers is designed to estimate the unmeasurable velocities for each system. Then, based on the outputs of the velocity observers, we propose distributed control laws that enable the coordinated tracking control system to achieve uniform global exponential stability (UGES). Both theoretical analysis and numerical simulations are presented to validate the effectiveness of the proposed control scheme. Followed by the original paper, a typo and a mistake is corrected.