CVMar 18, 2025

FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data Classification

arXiv:2503.13814v16 citationsh-index: 27Has CodeIEEE Trans Geosci Remote Sens
Originality Incremental advance
AI Analysis

This addresses the problem of limited labeled data for multimodal remote sensing classification, though it appears incremental as it builds on existing world model and fusion techniques.

The paper tackles multimodal remote sensing data classification by proposing FusDreamer, a label-efficient world model that fuses hyperspectral, LiDAR, and text data, achieving effective results as demonstrated on four datasets.

World models significantly enhance hierarchical understanding, improving data integration and learning efficiency. To explore the potential of the world model in the remote sensing (RS) field, this paper proposes a label-efficient remote sensing world model for multimodal data fusion (FusDreamer). The FusDreamer uses the world model as a unified representation container to abstract common and high-level knowledge, promoting interactions across different types of data, \emph{i.e.}, hyperspectral (HSI), light detection and ranging (LiDAR), and text data. Initially, a new latent diffusion fusion and multimodal generation paradigm (LaMG) is utilized for its exceptional information integration and detail retention capabilities. Subsequently, an open-world knowledge-guided consistency projection (OK-CP) module incorporates prompt representations for visually described objects and aligns language-visual features through contrastive learning. In this way, the domain gap can be bridged by fine-tuning the pre-trained world models with limited samples. Finally, an end-to-end multitask combinatorial optimization (MuCO) strategy can capture slight feature bias and constrain the diffusion process in a collaboratively learnable direction. Experiments conducted on four typical datasets indicate the effectiveness and advantages of the proposed FusDreamer. The corresponding code will be released at https://github.com/Cimy-wang/FusDreamer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes