Online Model-Free Reinforcement Learning for the Automatic Control of a Flexible Wing Aircraft
This work addresses the need for robust real-time control mechanisms for flexible wing aircraft, representing an incremental improvement in applying reinforcement learning to a specific domain.
The paper tackled the challenging control problem of flexible wing aircraft, which suffer from high nonlinear deformations, by developing an online model-free reinforcement learning controller that demonstrated superior performance in simulations under various operating conditions.
The control problem of the flexible wing aircraft is challenging due to the prevailing and high nonlinear deformations in the flexible wing system. This urged for new control mechanisms that are robust to the real-time variations in the wing's aerodynamics. An online control mechanism based on a value iteration reinforcement learning process is developed for flexible wing aerial structures. It employs a model-free control policy framework and a guaranteed convergent adaptive learning architecture to solve the system's Bellman optimality equation. A Riccati equation is derived and shown to be equivalent to solving the underlying Bellman equation. The online reinforcement learning solution is implemented using means of an adaptive-critic mechanism. The controller is proven to be asymptotically stable in the Lyapunov sense. It is assessed through computer simulations and its superior performance is demonstrated on two scenarios under different operating conditions.