CVMar 11, 2021

PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation

arXiv:2103.06564v1107 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses segmentation problems in aerial imagery, which is crucial for applications like remote sensing and urban planning, by introducing a novel approach to handle imbalanced distributions and small objects, though it builds incrementally on existing FPN frameworks.

The paper tackles the challenges of aerial image segmentation, such as extreme foreground-background imbalance and multiple small objects, by proposing PointFlow, a point-wise affinity propagation module that uses sparse affinity maps to reduce background noise. The method achieves state-of-the-art speed and accuracy trade-offs on three aerial benchmarks and demonstrates generality on general semantic segmentation datasets.

Aerial Image Segmentation is a particular semantic segmentation problem and has several challenging characteristics that general semantic segmentation does not have. There are two critical issues: The one is an extremely foreground-background imbalanced distribution, and the other is multiple small objects along with the complex background. Such problems make the recent dense affinity context modeling perform poorly even compared with baselines due to over-introduced background context. To handle these problems, we propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow. Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features, which reduces the noise introduced by the background while keeping efficiency. In particular, we design a dual point matcher to select points from the salient area and object boundaries, respectively. Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods. Especially, our methods achieve the best speed and accuracy trade-off on three aerial benchmarks. Further experiments on three general semantic segmentation datasets prove the generality of our method. Code will be provided in (https: //github.com/lxtGH/PFSegNets).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes