Ryan Self

3papers

39citations

Novelty37%

AI Score20

Ranked #195,324 of 205,806 authors (top 95%)#1,769 in SY (top 86%)

3 Papers

SYNov 21, 2018

Online inverse reinforcement learning for nonlinear systems

Ryan Self, Michael Harlan, Rushikesh Kamalapurkar

This paper focuses on the development of an online inverse reinforcement learning (IRL) technique for a class of nonlinear systems. The developed approach utilizes observed state and input trajectories, and determines the unknown cost function and the unknown value function online. A parameter estimation technique is utilized to allow the developed IRL technique to determine the cost function weights in the presence of unknown dynamics. Simulation results are presented for a nonlinear system showing convergence of both unknown reward function weights and unknown dynamics.

SYJul 10, 2019

Output-feedback online optimal control for a class of nonlinear systems

Ryan Self, Michael Harlan, Rushikesh Kamalapurkar

In this paper an output-feedback model-based reinforcement learning (MBRL) method for a class of second-order nonlinear systems is developed. The control technique uses exact model knowledge and integrates a dynamic state estimator within the model-based reinforcement learning framework to achieve output-feedback MBRL. Simulation results demonstrate the efficacy of the developed method.

SYNov 3, 2020

Online Observer-Based Inverse Reinforcement Learning

Ryan Self, Kevin Coleman, He Bai et al.

In this paper, a novel approach to the output-feedback inverse reinforcement learning (IRL) problem is developed by casting the IRL problem, for linear systems with quadratic cost functions, as a state estimation problem. Two observer-based techniques for IRL are developed, including a novel observer method that re-uses previous state estimates via history stacks. Theoretical guarantees for convergence and robustness are established under appropriate excitation conditions. Simulations demonstrate the performance of the developed observers and filters under noisy and noise-free measurements.