CVDec 23, 2024

URoadNet: Dual Sparse Attentive U-Net for Multiscale Road Network Extraction

arXiv:2412.17573v11 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses road extraction from remote sensing images for applications like mapping, but it is incremental as it builds on U-Net with attention mechanisms.

The paper tackled road network segmentation by introducing URoadNet, a dual sparse attentive U-Net that encodes local connectivity and global topology, outperforming state-of-the-art methods on multiple datasets like Massachusetts and DeepGlobe.

The challenges of road network segmentation demand an algorithm capable of adapting to the sparse and irregular shapes, as well as the diverse context, which often leads traditional encoding-decoding methods and simple Transformer embeddings to failure. We introduce a computationally efficient and powerful framework for elegant road-aware segmentation. Our method, called URoadNet, effectively encodes fine-grained local road connectivity and holistic global topological semantics while decoding multiscale road network information. URoadNet offers a novel alternative to the U-Net architecture by integrating connectivity attention, which can exploit intra-road interactions across multi-level sampling features with reduced computational complexity. This local interaction serves as valuable prior information for learning global interactions between road networks and the background through another integrality attention mechanism. The two forms of sparse attention are arranged alternatively and complementarily, and trained jointly, resulting in performance improvements without significant increases in computational complexity. Extensive experiments on various datasets with different resolutions, including Massachusetts, DeepGlobe, SpaceNet, and Large-Scale remote sensing images, demonstrate that URoadNet outperforms state-of-the-art techniques. Our approach represents a significant advancement in the field of road network extraction, providing a computationally feasible solution that achieves high-quality segmentation results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes