CVAug 12, 2024

CMAB: A First National-Scale Multi-Attribute Building Dataset in China Derived from Open Source Data and GeoAI

arXiv:2408.05891v31 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for comprehensive building data for urban analysis and planning, particularly for global SDGs, but is incremental as it applies existing GeoAI methods to a new large-scale context.

The paper tackled the problem of incomplete multi-attribute building data by introducing the first national-scale Multi-Attribute Building dataset (CMAB) in China, covering 29 million buildings with attributes like rooftop, height, function, age, and quality, achieving an F1-Score of 89.93% in extraction and validation accuracies mostly above 80%.

Rapidly acquiring three-dimensional (3D) building data, including geometric attributes like rooftop, height and orientations, as well as indicative attributes like function, quality, and age, is essential for accurate urban analysis, simulations, and policy updates. Current building datasets suffer from incomplete coverage of building multi-attributes. This paper introduces a geospatial artificial intelligence (GeoAI) framework for large-scale building modeling, presenting the first national-scale Multi-Attribute Building dataset (CMAB), covering 3,667 spatial cities, 29 million buildings, and 21.3 billion square meters of rooftops with an F1-Score of 89.93% in OCRNet-based extraction, totaling 337.7 billion cubic meters of building stock. We trained bootstrap aggregated XGBoost models with city administrative classifications, incorporating features such as morphology, location, and function. Using multi-source data, including billions of high-resolution Google Earth images and 60 million street view images (SVIs), we generated rooftop, height, function, age, and quality attributes for each building. Accuracy was validated through model benchmarks, existing similar products, and manual SVI validation, mostly above 80%. Our dataset and results are crucial for global SDGs and urban planning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes