CVNov 20, 2023

SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles

arXiv:2311.11580v14 citationsh-index: 6
Originality Synthesis-oriented
AI Analysis

This work addresses scene understanding for USVs, which is crucial for surveillance tasks, but it is incremental as it adapts existing techniques to a new application domain.

The paper tackles dynamic scene change detection in maritime video data for Unmanned Surface Vehicles (USVs) by proposing SeaDSC, an unsupervised method using a modified VQ-VAE-2 model and a novel similarity scoring technique, achieving efficient performance as demonstrated on the RoboWhaler dataset.

Recently, there has been an upsurge in the research on maritime vision, where a lot of works are influenced by the application of computer vision for Unmanned Surface Vehicles (USVs). Various sensor modalities such as camera, radar, and lidar have been used to perform tasks such as object detection, segmentation, object tracking, and motion planning. A large subset of this research is focused on the video analysis, since most of the current vessel fleets contain the camera's onboard for various surveillance tasks. Due to the vast abundance of the video data, video scene change detection is an initial and crucial stage for scene understanding of USVs. This paper outlines our approach to detect dynamic scene changes in USVs. To the best of our understanding, this work represents the first investigation of scene change detection in the maritime vision application. Our objective is to identify significant changes in the dynamic scenes of maritime video data, particularly those scenes that exhibit a high degree of resemblance. In our system for dynamic scene change detection, we propose completely unsupervised learning method. In contrast to earlier studies, we utilize a modified cutting-edge generative picture model called VQ-VAE-2 to train on multiple marine datasets, aiming to enhance the feature extraction. Next, we introduce our innovative similarity scoring technique for directly calculating the level of similarity in a sequence of consecutive frames by utilizing grid calculation on retrieved features. The experiments were conducted using a nautical video dataset called RoboWhaler to showcase the efficient performance of our technique.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes