ROAICVLGJul 24, 2018

ClusterNet: 3D Instance Segmentation in RGB-D Images

arXiv:1807.08894v210 citations
AI Analysis

This addresses instance-level scene understanding for autonomous robots, enabling safer decision-making, but it appears incremental as it builds on existing instance segmentation methods with a novel clustering approach.

The paper tackles 3D instance segmentation in RGB-D images for autonomous robots by proposing a method that uses moments of object occupancy and a DNN to vote for object centers, sizes, and poses, achieving state-of-the-art performance on a synthesized dataset and better visual results on real-world data.

We propose a method for instance-level segmentation that uses RGB-D data as input and provides detailed information about the location, geometry and number of individual objects in the scene. This level of understanding is fundamental for autonomous robots. It enables safe and robust decision-making under the large uncertainty of the real-world. In our model, we propose to use the first and second order moments of the object occupancy function to represent an object instance. We train an hourglass Deep Neural Network (DNN) where each pixel in the output votes for the 3D position of the corresponding object center and for the object's size and pose. The final instance segmentation is achieved through clustering in the space of moments. The object-centric training loss is defined on the output of the clustering. Our method outperforms the state-of-the-art instance segmentation method on our synthesized dataset. We show that our method generalizes well on real-world data achieving visually better segmentation results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes