CVLGROIVApr 1, 2020

EPOS: Estimating 6D Pose of Objects with Symmetries

arXiv:2004.00605v1273 citations
Originality Highly original
AI Analysis

This addresses the challenge of object pose estimation for robotics and computer vision, particularly for objects with symmetries, representing a strong specific gain rather than a foundational advance.

The paper tackles the problem of estimating 6D pose of rigid objects with symmetries from a single RGB image, achieving state-of-the-art results on datasets like T-LESS, LM-O, and YCB-V, with a large margin over competitors in some cases.

We present a new method for estimating the 6D pose of rigid objects with available 3D models from a single RGB input image. The method is applicable to a broad range of objects, including challenging ones with global or partial symmetries. An object is represented by compact surface fragments which allow handling symmetries in a systematic manner. Correspondences between densely sampled pixels and the fragments are predicted using an encoder-decoder network. At each pixel, the network predicts: (i) the probability of each object's presence, (ii) the probability of the fragments given the object's presence, and (iii) the precise 3D location on each fragment. A data-dependent number of corresponding 3D locations is selected per pixel, and poses of possibly multiple object instances are estimated using a robust and efficient variant of the PnP-RANSAC algorithm. In the BOP Challenge 2019, the method outperforms all RGB and most RGB-D and D methods on the T-LESS and LM-O datasets. On the YCB-V dataset, it is superior to all competitors, with a large margin over the second-best RGB method. Source code is at: cmp.felk.cvut.cz/epos.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes