ROMay 11, 2019

Fast Skill Learning for Variable Compliance Robotic Assembly

arXiv:1905.04427v12 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency and safety challenges in robotic assembly for industrial automation, representing an incremental improvement over existing methods.

The paper tackles the sample complexity problem in reinforcement learning for robotic assembly by incorporating a Kalman filter-based model fitting strategy into model-guided exploration, achieving accelerated learning while maintaining model-free benefits for uncertain environments, and also addresses robot overload by restricting potential energy in compliance control to keep contact forces within safe limits.

The robotic assembly represents a group of benchmark problems for reinforcement learning and variable compliance control that features sophisticated contact manipulation. One of the key challenges in applying reinforcement learning to physical robot is the sample complexity, the requirement of large amounts of experience for learning. We mitigate this sample complexity problem by incorporating an iteratively refitted model into the learning process through model-guided exploration. Yet, fitting a local model of the physical environment is of major difficulties. In this work, a Kalman filter is used to combine the adaptive linear dynamics with a coarse prior model from analytical description, and proves to give more accurate predictions than the existing method. Experimental results show that the proposed model fitting strategy can be incorporated into a model predictive controller to generate good exploration behaviors for learning acceleration, while preserving the benefits of model-free reinforcement learning for uncertain environments. In addition to the sample complexity, the inevitable robot overloaded during operation also tends to limit the learning efficiency. To address this problem, we present a method to restrict the largest possible potential energy in the compliance control system and therefore keep the contact force within the legitimate range.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes