CVROOct 23, 2019

6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints

arXiv:1910.10750v1174 citationsHas Code
Originality Highly original
AI Analysis

This addresses the problem of robust object tracking for robotics and vision applications, offering a novel approach with strong performance gains.

The paper tackles category-level 6D object pose tracking on RGB-D data by learning 3D keypoints end-to-end without manual supervision, achieving real-time tracking and substantially outperforming existing methods on the NOCS benchmark.

We present 6-PACK, a deep learning approach to category-level 6D object pose tracking on RGB-D data. Our method tracks in real-time novel object instances of known object categories such as bowls, laptops, and mugs. 6-PACK learns to compactly represent an object by a handful of 3D keypoints, based on which the interframe motion of an object instance can be estimated through keypoint matching. These keypoints are learned end-to-end without manual supervision in order to be most effective for tracking. Our experiments show that our method substantially outperforms existing methods on the NOCS category-level 6D pose estimation benchmark and supports a physical robot to perform simple vision-based closed-loop manipulation tasks. Our code and video are available at https://sites.google.com/view/6packtracking.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes