NE SYNov 4, 2019

Tustin neural networks: a class of recurrent nets for adaptive MPC of mechanical systems

Simone Pozzoli, Marco Gallieri, Riccardo Scattolini

arXiv:1911.01310v15.52 citations

Originality Incremental advance

AI Analysis

This work addresses a specific challenge in adaptive control for mechanical systems, representing an incremental improvement in recurrent network design for this domain.

The paper tackles the difficulty of using recurrent neural networks for unstable systems by proposing Tustin-Net, a new recurrent network for mechanical systems, and demonstrates its application in modeling a double inverted pendulum and adaptive model predictive control with reinforcement learning.

The use of recurrent neural networks to represent the dynamics of unstable systems is difficult due to the need to properly initialize their internal states, which in most of the cases do not have any physical meaning, consequent to the non-smoothness of the optimization problem. For this reason, in this paper focus is placed on mechanical systems characterized by a number of degrees of freedom, each one represented by two states, namely position and velocity. For these systems, a new recurrent neural network is proposed: Tustin-Net. Inspired by second-order dynamics, the network hidden states can be straightforwardly estimated, as their differential relationships with the measured states are hardcoded in the forward pass. The proposed structure is used to model a double inverted pendulum and for model-based Reinforcement Learning, where an adaptive Model Predictive Controller scheme using the Unscented Kalman Filter is proposed to deal with parameter changes in the system.

View on arXiv PDF

Similar