ROAILGOct 12, 2022

Interactive Language: Talking to Robots in Real Time

arXiv:2210.06407v1329 citationsh-index: 42Has Code
Originality Highly original
AI Analysis

This work addresses the challenge of making robots more accessible and capable through natural language interaction, representing a significant advancement over prior methods with a large-scale dataset and improved performance.

The authors tackled the problem of enabling real-time natural language interaction with robots by developing a framework trained on a large dataset of language-annotated trajectories, achieving a 93.5% success rate on 87,000 unique commands for visuo-linguo-motor tasks in the real world.

We present a framework for building interactive, real-time, natural language-instructable robots in the real world, and we open source related assets (dataset, environment, benchmark, and policies). Trained with behavioral cloning on a dataset of hundreds of thousands of language-annotated trajectories, a produced policy can proficiently execute an order of magnitude more commands than previous works: specifically we estimate a 93.5% success rate on a set of 87,000 unique natural language strings specifying raw end-to-end visuo-linguo-motor skills in the real world. We find that the same policy is capable of being guided by a human via real-time language to address a wide range of precise long-horizon rearrangement goals, e.g. "make a smiley face out of blocks". The dataset we release comprises nearly 600,000 language-labeled trajectories, an order of magnitude larger than prior available datasets. We hope the demonstrated results and associated assets enable further advancement of helpful, capable, natural-language-interactable robots. See videos at https://interactive-language.github.io.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes