CVAIJul 23, 2023

Expediting Building Footprint Extraction from High-resolution Remote Sensing Images via progressive lenient supervision

arXiv:2307.12220v2h-index: 81
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in remote sensing for applications like urban planning, though it is incremental as it builds on existing encoder-decoder architectures.

The paper tackles the problem of inefficient model transfer in building footprint segmentation from remote sensing images by proposing BFSeg, a framework with a densely-connected coarse-to-fine decoder and lenient deep supervision, which achieves superior performance and efficiency across various encoder networks.

The efficacy of building footprint segmentation from remotely sensed images has been hindered by model transfer effectiveness. Many existing building segmentation methods were developed upon the encoder-decoder architecture of U-Net, in which the encoder is finetuned from the newly developed backbone networks that are pre-trained on ImageNet. However, the heavy computational burden of the existing decoder designs hampers the successful transfer of these modern encoder networks to remote sensing tasks. Even the widely-adopted deep supervision strategy fails to mitigate these challenges due to its invalid loss in hybrid regions where foreground and background pixels are intermixed. In this paper, we conduct a comprehensive evaluation of existing decoder network designs for building footprint segmentation and propose an efficient framework denoted as BFSeg to enhance learning efficiency and effectiveness. Specifically, a densely-connected coarse-to-fine feature fusion decoder network that facilitates easy and fast feature fusion across scales is proposed. Moreover, considering the invalidity of hybrid regions in the down-sampled ground truth during the deep supervision process, we present a lenient deep supervision and distillation strategy that enables the network to learn proper knowledge from deep supervision. Building upon these advancements, we have developed a new family of building segmentation networks, which consistently surpass prior works with outstanding performance and efficiency across a wide range of newly developed encoder networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes