CVROJan 28, 2022

Mobile Robot Manipulation using Pure Object Detection

arXiv:2201.12437v214 citationsHas Code
AI Analysis

This addresses the problem of enabling mobile robots to manipulate objects in cluttered environments with low annotation costs, though it appears incremental as it builds on existing detection and control methods.

The paper tackles mobile robot manipulation by developing an end-to-end method based solely on object detection, introducing Task-focused Few-shot Object Detection (TFOD) to learn new objects with minimal annotation (e.g., as few as four clicks). It achieves state-of-the-art results on a visual servo control and depth estimation benchmark and creates a new benchmark for robotics object detection.

This paper addresses the problem of mobile robot manipulation using object detection. Our approach uses detection and control as complimentary functions that learn from real-world interactions. We develop an end-to-end manipulation method based solely on detection and introduce Task-focused Few-shot Object Detection (TFOD) to learn new objects and settings. Our robot collects its own training data and automatically determines when to retrain detection to improve performance across various subtasks (e.g., grasping). Notably, detection training is low-cost, and our robot learns to manipulate new objects using as few as four clicks of annotation. In physical experiments, our robot learns visual control from a single click of annotation and a novel update formulation, manipulates new objects in clutter and other mobile settings, and achieves state-of-the-art results on an existing visual servo control and depth estimation benchmark. Finally, we develop a TFOD Benchmark to support future object detection research for robotics: https://github.com/griffbr/tfod.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes