CVOct 19, 2019

SpatialFlow: Bridging All Tasks for Panoptic Segmentation

arXiv:1910.08787v333 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses a crucial problem in computer vision for improving scene understanding in applications like autonomous driving, though it appears incremental as it builds on existing panoptic segmentation frameworks.

The paper tackled the problem of integrating object location information into panoptic segmentation by proposing spatial information flows to bridge all sub-tasks, achieving state-of-the-art results of 47.9 PQ on MS-COCO and 62.5 PQ on Cityscapes benchmarks.

Object location is fundamental to panoptic segmentation as it is related to all things and stuff in the image scene. Knowing the locations of objects in the image provides clues for segmenting and helps the network better understand the scene. How to integrate object location in both thing and stuff segmentation is a crucial problem. In this paper, we propose spatial information flows to achieve this objective. The flows can bridge all sub-tasks in panoptic segmentation by delivering the object's spatial context from the box regression task to others. More importantly, we design four parallel sub-networks to get a preferable adaptation of object spatial information in sub-tasks. Upon the sub-networks and the flows, we present a location-aware and unified framework for panoptic segmentation, denoted as SpatialFlow. We perform a detailed ablation study on each component and conduct extensive experiments to prove the effectiveness of SpatialFlow. Furthermore, we achieve state-of-the-art results, which are $47.9$ PQ and $62.5$ PQ respectively on MS-COCO and Cityscapes panoptic benchmarks. Code will be available at https://github.com/chensnathan/SpatialFlow.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes