CVApr 7, 2017

Pixelwise Instance Segmentation with a Dynamically Instantiated Network

arXiv:1704.02386v1241 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the need for precise pixel-level instance segmentation in computer vision, offering an incremental improvement over existing methods.

The paper tackles the problem of instance segmentation by proposing a system that assigns each pixel an object class and instance identity, achieving state-of-the-art results on Pascal VOC and Cityscapes datasets with high IoU thresholds.

Semantic segmentation and object detection research have recently achieved rapid progress. However, the former task has no notion of different instances of the same object, and the latter operates at a coarse, bounding-box level. We propose an Instance Segmentation system that produces a segmentation map where each pixel is assigned an object class and instance identity label. Most approaches adapt object detectors to produce segments instead of boxes. In contrast, our method is based on an initial semantic segmentation module, which feeds into an instance subnetwork. This subnetwork uses the initial category-level segmentation, along with cues from the output of an object detector, within an end-to-end CRF to predict instances. This part of our model is dynamically instantiated to produce a variable number of instances per image. Our end-to-end approach requires no post-processing and considers the image holistically, instead of processing independent proposals. Therefore, unlike some related work, a pixel cannot belong to multiple instances. Furthermore, far more precise segmentations are achieved, as shown by our state-of-the-art results (particularly at high IoU thresholds) on the Pascal VOC and Cityscapes datasets.

View on arXiv PDF Code

Similar