CV AI IVAug 20, 2024

Hierarchical Attention Diffusion Networks with Object Priors for Video Change Detection

Andrew Kiruluta, Eric Lundy, Andreas Lemos

arXiv:2408.10619v21 citationsh-index: 3

Originality Highly original

AI Analysis

This provides a new state-of-the-art solution for remote sensing change detection, offering detailed multi-class maps.

The paper tackles video change detection by combining instance masking, multi-scale attention in a diffusion model, and semantic classification, achieving 10-25 point improvements in F1 and IoU over existing methods.

We present a unified change detection pipeline that combines instance level masking, multi\-scale attention within a denoising diffusion model, and per pixel semantic classification, all refined via SSIM to match human perception. By first isolating only temporally novel objects with Mask R\-CNN, then guiding diffusion updates through hierarchical cross attention to object and global contexts, and finally categorizing each pixel into one of C change types, our method delivers detailed, interpretable multi\-class maps. It outperforms traditional differencing, Siamese CNNs, and GAN\-based detectors by 10\-25 points in F1 and IoU on both synthetic and real world benchmarks, marking a new state of the art in remote sensing change detection.

View on arXiv PDF

Similar