CVSep 5, 2019

Semantic Correlation Promoted Shape-Variant Context for Segmentation

arXiv:1909.02651v1179 citations
Originality Highly original
AI Analysis

This addresses the challenge of handling diverse object shapes and layouts in scene images for segmentation tasks, representing a novel method rather than an incremental improvement.

The authors tackled the problem of aggregating context for semantic segmentation by generating scale- and shape-variant semantic masks for each pixel, leading to state-of-the-art results on six public datasets.

Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus ineffective or inefficient to aggregate various context information from a predefined fixed region. In this work, we propose to generate a scale- and shape-variant semantic mask for each pixel to confine its contextual region. To this end, we first propose a novel paired convolution to infer the semantic correlation of the pair and based on that to generate a shape mask. Using the inferred spatial scope of the contextual region, we propose a shape-variant convolution, of which the receptive field is controlled by the shape mask that varies with the appearance of input. In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region. Furthermore, this work also proposes a labeling denoising model to reduce wrong predictions caused by the noisy low-level features. Without bells and whistles, the proposed segmentation network achieves new state-of-the-arts consistently on the six public segmentation datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes