CVMar 15, 2022

GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

arXiv:2203.07918v2174 citationsh-index: 58
AI Analysis

It addresses the problem of limited applications in robotics and AR/VR by enabling pose estimation for diverse object categories, though it is incremental with novel components building on existing category-level methods.

The paper tackles category-level 6D object pose estimation for unseen instances by proposing GPV-Pose, which uses geometry-guided point-wise voting and decoupled rotation representation to handle intra-class shape variations, achieving state-of-the-art results on benchmarks with near real-time inference at 20 FPS.

While 6D object pose estimation has recently made a huge leap forward, most methods can still only handle a single or a handful of different objects, which limits their applications. To circumvent this problem, category-level object pose estimation has recently been revamped, which aims at predicting the 6D pose as well as the 3D metric size for previously unseen instances from a given set of object classes. This is, however, a much more challenging task due to severe intra-class shape variations. To address this issue, we propose GPV-Pose, a novel framework for robust category-level pose estimation, harnessing geometric insights to enhance the learning of category-level pose-sensitive features. First, we introduce a decoupled confidence-driven rotation representation, which allows geometry-aware recovery of the associated rotation matrix. Second, we propose a novel geometry-guided point-wise voting paradigm for robust retrieval of the 3D object bounding box. Finally, leveraging these different output streams, we can enforce several geometric consistency terms, further increasing performance, especially for non-symmetric categories. GPV-Pose produces superior results to state-of-the-art competitors on common public benchmarks, whilst almost achieving real-time inference speed at 20 FPS.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes