CV ROApr 12

Point2Pose: Occlusion-Recovering 6D Pose Tracking and 3D Reconstruction for Multiple Unknown Objects Via 2D Point Trackers

Tzu-Yuan Lin, Ho Jae Lee, Kevin Doherty, Yonghyeon Lee, Sangbae Kim

arXiv:2604.1041525.01 citationsh-index: 1

AI Analysis

For robotic manipulation and augmented reality, this method removes the need for object CAD models or category priors, enabling tracking of unseen objects with occlusion recovery.

Point2Pose enables model-free 6D pose tracking of multiple unknown rigid objects from monocular RGB-D video, using only sparse 2D point initialization. It achieves performance comparable to state-of-the-art on severe-occlusion benchmarks while supporting multi-object tracking and recovery from complete occlusion.

We present Point2Pose, a model-free method for causal 6D pose tracking of multiple rigid objects from monocular RGB-D video. Initialized only from sparse image points on the objects to be tracked, our approach tracks multiple unseen objects without requiring object CAD models or category priors. Point2Pose leverages a 2D point tracker to obtain long-range correspondences, enabling instant recovery after complete occlusion. Simultaneously, the system incrementally reconstructs an online Truncated Signed Distance Function (TSDF) representation of the tracked targets. Alongside the method, we introduce a new multi-object tracking dataset comprising both simulation and real-world sequences, with motion-capture ground truth for evaluation. Experiments show that Point2Pose achieves performance comparable to the state-of-the-art methods on a severe-occlusion benchmark, while additionally supporting multi-object tracking and recovery from complete occlusion, capabilities that are not supported by previous model-free tracking approaches.

View on arXiv PDF

Similar