CVJul 15, 2025

Graph Aggregation Prototype Learning for Semantic Change Detection in Remote Sensing

arXiv:2507.10938v11 citationsh-index: 15
Originality Incremental advance
AI Analysis

This work addresses semantic change detection for remote sensing applications, offering incremental improvements over existing methods.

The paper tackles the problem of semantic change detection in remote sensing, which requires identifying both change locations and detailed 'from-to' categories, by proposing a multi-task framework that addresses negative transfer issues through adaptive weight allocation and gradient rotation. The method achieves state-of-the-art performance on SECOND and Landsat-SCD datasets, with significant improvements in accuracy and robustness.

Semantic change detection (SCD) extends the binary change detection task to provide not only the change locations but also the detailed "from-to" categories in multi-temporal remote sensing data. Such detailed semantic insights into changes offer considerable advantages for a wide array of applications. However, since SCD involves the simultaneous optimization of multiple tasks, the model is prone to negative transfer due to task-specific learning difficulties and conflicting gradient flows. To address this issue, we propose Graph Aggregation Prototype Learning for Semantic Change Detection in remote sensing(GAPL-SCD). In this framework, a multi-task joint optimization method is designed to optimize the primary task of semantic segmentation and change detection, along with the auxiliary task of graph aggregation prototype learning. Adaptive weight allocation and gradient rotation methods are used to alleviate the conflict between training tasks and improve multi-task learning capabilities. Specifically, the graph aggregation prototype learning module constructs an interaction graph using high-level features. Prototypes serve as class proxies, enabling category-level domain alignment across time points and reducing interference from irrelevant changes. Additionally, the proposed self-query multi-level feature interaction and bi-temporal feature fusion modules further enhance multi-scale feature representation, improving performance in complex scenes. Experimental results on the SECOND and Landsat-SCD datasets demonstrate that our method achieves state-of-the-art performance, with significant improvements in accuracy and robustness for SCD task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes