ROLGFeb 25, 2021

Efficient and Interpretable Robot Manipulation with Graph Neural Networks

arXiv:2102.13177v451 citations
Originality Incremental advance
AI Analysis

This addresses the problem of enabling robots to perform complex long-horizon manipulation tasks without detailed task descriptions, though it is incremental as it builds on existing GNN and imitation learning methods.

The paper tackles robot manipulation tasks by framing them as a classification problem over a graph using a graph neural network (GNN) policy trained with imitation learning on 20 expert demonstrations, achieving generalization to more objects, goal configurations, and from simulation to real-world tasks like blockstacking and dishwasher loading.

Manipulation tasks, like loading a dishwasher, can be seen as a sequence of spatial constraints and relationships between different objects. We aim to discover these rules from demonstrations by posing manipulation as a classification problem over a graph, whose nodes represent task-relevant entities like objects and goals, and present a graph neural network (GNN) policy architecture for solving this problem from demonstrations. In our experiments, a single GNN policy trained using imitation learning (IL) on 20 expert demos can solve blockstacking, rearrangement, and dishwasher loading tasks; once the policy has learned the spatial structure, it can generalize to a larger number of objects, goal configurations, and from simulation to the real world. These experiments show that graphical IL can solve complex long-horizon manipulation problems without requiring detailed task descriptions. Videos can be found at: https://youtu.be/POxaTDAj7aY.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes