CVROJan 19, 2022

Semi-automatic 3D Object Keypoint Annotation and Detection for the Masses

arXiv:2201.07665v18 citations
AI Analysis

This addresses the bottleneck of dataset creation for robotics researchers, allowing them to use learning-based methods with custom objects instead of being restricted to standardized sets like YCB.

The paper tackles the problem of limited object keypoint datasets for robotics by presenting a semi-automatic toolkit that reduces the time for data collection, annotation, and learning to just a couple of hours, enabling rapid deployment of 3D keypoint detectors.

Creating computer vision datasets requires careful planning and lots of time and effort. In robotics research, we often have to use standardized objects, such as the YCB object set, for tasks such as object tracking, pose estimation, grasping and manipulation, as there are datasets and pre-learned methods available for these objects. This limits the impact of our research since learning-based computer vision methods can only be used in scenarios that are supported by existing datasets. In this work, we present a full object keypoint tracking toolkit, encompassing the entire process from data collection, labeling, model learning and evaluation. We present a semi-automatic way of collecting and labeling datasets using a wrist mounted camera on a standard robotic arm. Using our toolkit and method, we are able to obtain a working 3D object keypoint detector and go through the whole process of data collection, annotation and learning in just a couple hours of active time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes