CVMay 28

OmniCD: A Foundational Framework for Remote Sensing Image Change Detection Guided by Multimodal Semantics

arXiv:2605.3016839.3
Predicted impact top 79% in CV · last 90 daysOriginality Highly original
AI Analysis

For remote sensing practitioners, OmniCD provides a general-purpose change detection system that generalizes across diverse scenarios, addressing the bottleneck of poor cross-domain robustness.

OmniCD introduces a foundational framework for remote sensing change detection that leverages multimodal semantic guidance (text, semantic maps, geospatial metadata) to unify tasks from binary CD to zero-shot semantic change. It achieves state-of-the-art performance across benchmarks, supported by a new large-scale dataset RSITCD with 300K+ annotated pairs.

Change detection (CD) in remote sensing is vital for applications such as urban monitoring and disaster assessment, yet traditional methods struggle with generalization across diverse scenarios. We present OmniCD, a foundational framework that unifies and enhances remote sensing CD through multimodal semantic guidance. OmniCD incorporates image and text prompts -- such as textual descriptions, semantic maps, and geospatial metadata -- into a unified architecture, supporting tasks from binary CD to zero-shot semantic change understanding. The framework integrates a hierarchical scene retrieval module and a change detection module, reinforced by a style disentanglement mechanism for improved cross-domain robustness. We further introduce RSITCD, a large-scale multimodal dataset with 300K+ annotated image-text pairs. Extensive experiments show that OmniCD achieves state-of-the-art performance across benchmarks, demonstrating strong adaptability and setting a solid foundation for general-purpose CD systems in remote sensing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes