End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation
This addresses the ill-posed nature of pose estimation for XR applications by providing multiple plausible pose candidates, though it is incremental as it builds on an existing method.
The paper tackled the problem of 6D object pose estimation by reformulating a state-of-the-art algorithm to predict a probability density distribution of poses instead of a single pose, demonstrating superior quantitative results on datasets like LM-O, YCB-V, and ITODD.
6D object pose estimation is the problem of identifying the position and orientation of an object relative to a chosen coordinate system, which is a core technology for modern XR applications. State-of-the-art 6D object pose estimators directly predict an object pose given an object observation. Due to the ill-posed nature of the pose estimation problem, where multiple different poses can correspond to a single observation, generating additional plausible estimates per observation can be valuable. To address this, we reformulate the state-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End Probabilistic Geometry-Guided Regression). Instead of predicting a single pose per detection, we estimate a probability density distribution of the pose. Using the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose Estimation) Challenge, we test our approach on four of its core datasets and demonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and ITODD. Our probabilistic solution shows that predicting a pose distribution instead of a single pose can improve state-of-the-art single-view pose estimation while providing the additional benefit of being able to sample multiple meaningful pose candidates.