CVJul 28, 2025

SCANet: Split Coordinate Attention Network for Building Footprint Extraction

arXiv:2507.20809v11 citationsh-index: 3Has CodeICONIP
Originality Incremental advance
AI Analysis

This work addresses building footprint extraction for applications in urban planning and environmental monitoring, representing an incremental improvement over existing deep learning methods.

The paper tackles building footprint extraction from remote sensing images by proposing a novel plug-and-play attention module called Split Coordinate Attention (SCA), which integrates into a 2D CNN to form SCANet, achieving state-of-the-art results with IoU scores of 91.61% and 75.49% on two public datasets.

Building footprint extraction holds immense significance in remote sensing image analysis and has great value in urban planning, land use, environmental protection and disaster assessment. Despite the progress made by conventional and deep learning approaches in this field, they continue to encounter significant challenges. This paper introduces a novel plug-and-play attention module, Split Coordinate Attention (SCA), which ingeniously captures spatially remote interactions by employing two spatial range of pooling kernels, strategically encoding each channel along x and y planes, and separately performs a series of split operations for each feature group, thus enabling more efficient semantic feature extraction. By inserting into a 2D CNN to form an effective SCANet, our SCANet outperforms recent SOTA methods on the public Wuhan University (WHU) Building Dataset and Massachusetts Building Dataset in terms of various metrics. Particularly SCANet achieves the best IoU, 91.61% and 75.49% for the two datasets. Our code is available at https://github.com/AiEson/SCANet

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes