ROAug 1, 2016

DOOMED: Direct Online Optimization of Modeling Errors in Dynamics

Nathan Ratliff, Franziska Meier, Daniel Kappler, Stefan Schaal

arXiv:1608.00309v211.720 citations

Originality Incremental advance

AI Analysis

This work addresses a key bottleneck in robotics and control systems by enabling more accurate and compliant motion, though it is an incremental improvement over existing adaptive control methods.

The paper tackles the problem of inaccurate inverse dynamics models in model-based control by introducing DOOMED, a gradient-based online learning algorithm that directly minimizes the divergence between actual and desired accelerations, achieving improved tracking performance in real-time without requiring prior accurate models.

It has long been hoped that model-based control will improve tracking performance while maintaining or increasing compliance. This hope hinges on having or being able to estimate an accurate inverse dynamics model. As a result, substantial effort has gone into modeling and estimating dynamics (error) models. Most recent research has focused on learning the true inverse dynamics using data points mapping observed accelerations to the torques used to generate them. Unfortunately, if the initial tracking error is bad, such learning processes may train substantially off-distribution to predict well on actual observed acceleration rather then the desired accelerations. This work takes a different approach. We define a class of gradient-based online learning algorithms we term Direct Online Optimization for Modeling Errors in Dynamics (DOOMED) that directly minimize an objective measuring the divergence between actual and desired accelerations. Our objective is defined in terms of the true system's unknown dynamics and is therefore impossible to evaluate. However, we show that its gradient is measurable online from system data. We develop a novel adaptive control approach based on running online learning to directly correct (inverse) dynamics errors in real time using the data stream from the robot to accurately achieve desired accelerations during execution.

View on arXiv PDF

Similar