CVIVJan 15, 2023

BuildSeg: A General Framework for the Segmentation of Buildings

arXiv:2301.06190v19 citationsh-index: 56
Originality Synthesis-oriented
AI Analysis

This work addresses the need for more accurate building segmentation for applications like automatic mapping, but it is incremental as it applies existing models with minor enhancements.

The authors tackled building segmentation from aerial and LiDAR data by proposing BuildSeg, a general framework combining multiple data sources, achieving an IOU of 0.7902 and a boundary IOU of 0.6185 on datasets from Norway, Denmark, and France.

Building segmentation from aerial images and 3D laser scanning (LiDAR) is a challenging task due to the diversity of backgrounds, building textures, and image quality. While current research using different types of convolutional and transformer networks has considerably improved the performance on this task, even more accurate segmentation methods for buildings are desirable for applications such as automatic mapping. In this study, we propose a general framework termed \emph{BuildSeg} employing a generic approach that can be quickly applied to segment buildings. Different data sources were combined to increase generalization performance. The approach yields good results for different data sources as shown by experiments on high-resolution multi-spectral and LiDAR imagery of cities in Norway, Denmark and France. We applied ConvNeXt and SegFormer based models on the high resolution aerial image dataset from the MapAI-competition. The methods achieved an IOU of 0.7902 and a boundary IOU of 0.6185. We used post-processing to account for the rectangular shape of the objects. This increased the boundary IOU from 0.6185 to 0.6189.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes