CVOct 8, 2018

Robust 6D Object Pose Estimation in Cluttered Scenes using Semantic Segmentation and Pose Regression Networks

Arul Selvam Periyasamy, Max Schwarz, Sven Behnke

arXiv:1810.03410v17.320 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a crucial need for robots in autonomous manipulation in dynamic environments like warehouses, but it is incremental as it builds on prior semantic segmentation methods.

The paper tackles the problem of 6D object pose estimation in cluttered scenes, such as warehouses, by proposing a pipeline that reduces reliance on pre-existing 3D models through synthetic data generation and semantic segmentation, achieving evaluation on both synthetic and real-world datasets.

Object pose estimation is a crucial prerequisite for robots to perform autonomous manipulation in clutter. Real-world bin-picking settings such as warehouses present additional challenges, e.g., new objects are added constantly. Most of the existing object pose estimation methods assume that 3D models of the objects is available beforehand. We present a pipeline that requires minimal human intervention and circumvents the reliance on the availability of 3D models by a fast data acquisition method and a synthetic data generation procedure. This work builds on previous work on semantic segmentation of cluttered bin-picking scenes to isolate individual objects in clutter. An additional network is trained on synthetic scenes to estimate object poses from a cropped object-centered encoding extracted from the segmentation results. The proposed method is evaluated on a synthetic validation dataset and cluttered real-world scenes.

View on arXiv PDF

Similar