CVMay 26, 2023

Detect Any Shadow: Segment Anything for Video Shadow Detection

arXiv:2305.16698v238 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of accurate and efficient shadow detection in videos, which is important for applications like video editing and surveillance, but it is incremental as it builds upon existing SAM and video processing techniques.

The paper tackles the problem of shadow detection in videos by fine-tuning the Segment Anything Model (SAM) to detect shadows and extending it with a long short-term attention mechanism for video processing, achieving state-of-the-art results with improved MAE and IoU metrics and faster inference speed.

Segment anything model (SAM) has achieved great success in the field of natural image segmentation. Nevertheless, SAM tends to consider shadows as background and therefore does not perform segmentation on them. In this paper, we propose ShadowSAM, a simple yet effective framework for fine-tuning SAM to detect shadows. Besides, by combining it with long short-term attention mechanism, we extend its capability for efficient video shadow detection. Specifically, we first fine-tune SAM on ViSha training dataset by utilizing the bounding boxes obtained from the ground truth shadow mask. Then during the inference stage, we simulate user interaction by providing bounding boxes to detect a specific frame (e.g., the first frame). Subsequently, using the detected shadow mask as a prior, we employ a long short-term network to learn spatial correlations between distant frames and temporal consistency between adjacent frames, thereby achieving precise shadow information propagation across video frames. Extensive experimental results demonstrate the effectiveness of our method, with notable margin over the state-of-the-art approaches in terms of MAE and IoU metrics. Moreover, our method exhibits accelerated inference speed compared to previous video shadow detection approaches, validating the effectiveness and efficiency of our method. The source code is now publicly available at https://github.com/harrytea/Detect-AnyShadow.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes