CVIVJan 31, 2025

CerraData-4MM: A multimodal benchmark dataset on Cerrado for land use and land cover classification

arXiv:2502.00083v11 citationsh-index: 2Has CodeIEEE J Sel Top Appl Earth Obs Remote Sens
Originality Synthesis-oriented
AI Analysis

This provides a benchmark dataset for researchers in remote sensing and environmental monitoring to address class imbalance and multimodal fusion, but it is incremental as it applies existing methods to new data.

The authors tackled land use and land cover classification in the Cerrado by introducing CerraData-4MM, a multimodal dataset combining SAR and MSI imagery, and found that a Vision Transformer model achieved a macro F1-score of 57.60% and mIoU of 49.05% at the first hierarchical level, though both models struggled with minority classes.

The Cerrado faces increasing environmental pressures, necessitating accurate land use and land cover (LULC) mapping despite challenges such as class imbalance and visually similar categories. To address this, we present CerraData-4MM, a multimodal dataset combining Sentinel-1 Synthetic Aperture Radar (SAR) and Sentinel-2 MultiSpectral Imagery (MSI) with 10m spatial resolution. The dataset includes two hierarchical classification levels with 7 and 14 classes, respectively, focusing on the diverse Bico do Papagaio ecoregion. We highlight CerraData-4MM's capacity to benchmark advanced semantic segmentation techniques by evaluating a standard U-Net and a more sophisticated Vision Transformer (ViT) model. The ViT achieves superior performance in multimodal scenarios, with the highest macro F1-score of 57.60% and a mean Intersection over Union (mIoU) of 49.05% at the first hierarchical level. Both models struggle with minority classes, particularly at the second hierarchical level, where U-Net's performance drops to an F1-score of 18.16%. Class balancing improves representation for underrepresented classes but reduces overall accuracy, underscoring the trade-off in weighted training. CerraData-4MM offers a challenging benchmark for advancing deep learning models to handle class imbalance and multimodal data fusion. Code, trained models, and data are publicly available at https://github.com/ai4luc/CerraData-4MM.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes