ROLGNov 22, 2016

Can Co-robots Learn to Teach?

arXiv:1611.07490v12 citations
Originality Highly original
AI Analysis

This work addresses the challenge of human-robot collaboration in complex tasks, offering a novel method for robots to teach humans, though it is incremental in building upon existing learning from demonstration techniques.

The paper tackles the problem of enabling robots to autonomously learn instructional policies from expert demonstrations to teach humans complex tasks in uncertain environments, proposing a Dirichlet process-based non-parametric Inverse Reinforcement Learning approach that clusters task spaces into subgoals and introduces action primitives for communication, with evaluation on a hydraulic excavator model showing effective teaching strategies.

We explore beyond existing work on learning from demonstration by asking the question: Can robots learn to teach?, that is, can a robot autonomously learn an instructional policy from expert demonstration and use it to instruct or collaborate with humans in executing complex tasks in uncertain environments? In this paper we pursue a solution to this problem by leveraging the idea that humans often implicitly decompose a higher level task into several subgoals whose execution brings the task closer to completion. We propose Dirichlet process based non-parametric Inverse Reinforcement Learning (DPMIRL) approach for reward based unsupervised clustering of task space into subgoals. This approach is shown to capture the latent subgoals that a human teacher would have utilized to train a novice. The notion of action primitive is introduced as the means to communicate instruction policy to humans in the least complicated manner, and as a computationally efficient tool to segment demonstration data. We evaluate our approach through experiments on hydraulic actuated scaled model of an excavator and evaluate and compare different teaching strategies utilized by the robot.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes