Robust 6D Object Pose Estimation with Stochastic Congruent Sets
This work improves pose estimation for robotics and computer vision applications, though it is incremental as it builds on existing segmentation and registration techniques.
The paper tackles the problem of 6D object pose estimation by addressing noisy segmentation from CNNs, proposing a stochastic optimization method that uses segmentation confidence to sample pointsets and match them to 3D models, resulting in robust performance on an APC dataset and outperforming recent methods on the YCB dataset.
Object pose estimation is frequently achieved by first segmenting an RGB image and then, given depth data, registering the corresponding point cloud segment against the object's 3D model. Despite the progress due to CNNs, semantic segmentation output can be noisy, especially when the CNN is only trained on synthetic data. This causes registration methods to fail in estimating a good object pose. This work proposes a novel stochastic optimization process that treats the segmentation output of CNNs as a confidence probability. The algorithm, called Stochastic Congruent Sets (StoCS), samples pointsets on the point cloud according to the soft segmentation distribution and so as to agree with the object's known geometry. The pointsets are then matched to congruent sets on the 3D object model to generate pose estimates. StoCS is shown to be robust on an APC dataset, despite the fact the CNN is trained only on synthetic data. In the YCB dataset, StoCS outperforms a recent network for 6D pose estimation and alternative pointset matching techniques.