CVDec 11, 2024

SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting

arXiv:2412.08536v115 citationsh-index: 22WACV
Originality Incremental advance
AI Analysis

This work addresses the challenge of zero-shot land-use mapping for remote sensing applications, representing an incremental advancement by adapting existing methods to a specific domain.

The paper tackled the problem of limited zero-shot classification performance of vision-language models on satellite imagery by introducing SenCLIP, which leverages ground-level photos to enhance CLIP's representation for Sentinel-2 data, resulting in significant accuracy improvements on EuroSAT and BigEarthNet datasets.

Pre-trained vision-language models (VLMs), such as CLIP, demonstrate impressive zero-shot classification capabilities with free-form prompts and even show some generalization in specialized domains. However, their performance on satellite imagery is limited due to the underrepresentation of such data in their training sets, which predominantly consist of ground-level images. Existing prompting techniques for satellite imagery are often restricted to generic phrases like a satellite image of ..., limiting their effectiveness for zero-shot land-use and land-cover (LULC) mapping. To address these challenges, we introduce SenCLIP, which transfers CLIPs representation to Sentinel-2 imagery by leveraging a large dataset of Sentinel-2 images paired with geotagged ground-level photos from across Europe. We evaluate SenCLIP alongside other SOTA remote sensing VLMs on zero-shot LULC mapping tasks using the EuroSAT and BigEarthNet datasets with both aerial and ground-level prompting styles. Our approach, which aligns ground-level representations with satellite imagery, demonstrates significant improvements in classification accuracy across both prompt styles, opening new possibilities for applying free-form textual descriptions in zero-shot LULC mapping.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes