CVROSep 22, 2021

NudgeSeg: Zero-Shot Object Segmentation by Repeated Physical Interaction

arXiv:2109.13859v18 citations
Originality Highly original
AI Analysis

This addresses the limitation of deep neural networks in generalizing to unseen objects for robotics applications, offering a novel interactive approach.

The paper tackles the problem of segmenting never-before-seen objects in cluttered scenes by using a robot to repeatedly nudge objects and gather motion cues, achieving an average detection rate of over 86% on zero-shot objects.

Recent advances in object segmentation have demonstrated that deep neural networks excel at object segmentation for specific classes in color and depth images. However, their performance is dictated by the number of classes and objects used for training, thereby hindering generalization to never seen objects or zero-shot samples. To exacerbate the problem further, object segmentation using image frames rely on recognition and pattern matching cues. Instead, we utilize the 'active' nature of a robot and their ability to 'interact' with the environment to induce additional geometric constraints for segmenting zero-shot samples. In this paper, we present the first framework to segment unknown objects in a cluttered scene by repeatedly 'nudging' at the objects and moving them to obtain additional motion cues at every step using only a monochrome monocular camera. We call our framework NudgeSeg. These motion cues are used to refine the segmentation masks. We successfully test our approach to segment novel objects in various cluttered scenes and provide an extensive study with image and motion segmentation methods. We show an impressive average detection rate of over 86% on zero-shot objects.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes