CVApr 4, 2020

Pixel Consensus Voting for Panoptic Segmentation

arXiv:2004.01849v140 citationsHas Code
AI Analysis

This work addresses the problem of efficient and unified instance segmentation for computer vision applications, presenting an incremental improvement over existing methods.

The paper tackles panoptic segmentation by introducing Pixel Consensus Voting, a framework that uses pixel-wise voting for instance centroids and backprojection to generate masks, achieving competitive results on COCO and Cityscapes datasets.

The core of our approach, Pixel Consensus Voting, is a framework for instance segmentation based on the Generalized Hough transform. Pixels cast discretized, probabilistic votes for the likely regions that contain instance centroids. At the detected peaks that emerge in the voting heatmap, backprojection is applied to collect pixels and produce instance masks. Unlike a sliding window detector that densely enumerates object proposals, our method detects instances as a result of the consensus among pixel-wise votes. We implement vote aggregation and backprojection using native operators of a convolutional neural network. The discretization of centroid voting reduces the training of instance segmentation to pixel labeling, analogous and complementary to FCN-style semantic segmentation, leading to an efficient and unified architecture that jointly models things and stuff. We demonstrate the effectiveness of our pipeline on COCO and Cityscapes Panoptic Segmentation and obtain competitive results. Code will be open-sourced.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes