CVROMay 20

VSCD: Video-based Scene Change Detection in Unaligned Scenes

arXiv:2605.208213.5
Predicted impact top 88% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the challenging problem of detecting changes in indoor environments from unaligned videos, which is crucial for long-term autonomous robots.

The paper introduces Video-based Scene Change Detection (VSCD) for unaligned scenes, proposes a large-scale benchmark with over 1.1 million annotated frames, and presents a query-centric multi-reference model that achieves state-of-the-art performance, validated on a mobile robot for surveillance and incremental learning.

Detecting what has changed in an environment is essential for long-term autonomy, yet most change detection settings assume fixed viewpoints, mild misalignment, or only a few changed objects. We introduce Video-based Scene Change Detection (VSCD), which predicts a pixel-wise change mask for each query frame, given a reference and a query RGB video of the same indoor space recorded at different times under unconstrained camera motion. The two videos are not temporally synchronized, and many object instances may appear or disappear. To study this setting, we build a large-scale benchmark with over 1.1 million frames annotated with pixel-accurate change masks, together with a real-world test set for evaluating transfer beyond simulation. We propose a query-centric multi-reference model that learns temporal matching implicitly from change-mask supervision, aligns candidate reference features to the query via local patch correspondence, and fuses per-candidate change features using frame-level and patch-level confidence before decoding a high-resolution mask once per frame. Our approach achieves state-of-the-art performance against strong image- and video-based baselines, and we validate its real-world impact by deploying it on a mobile robot for two downstream applications -- visual surveillance and object incremental learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes