CVJul 29, 2021

Temporal Feature Warping for Video Shadow Detection

arXiv:2107.14287v110 citations
Originality Incremental advance
AI Analysis

This addresses the problem of detecting shadows in videos for computer vision applications, offering a significant but incremental improvement over existing methods.

The paper tackles video shadow detection by proposing a temporal feature warping method using optical flow to align and combine features across frames, achieving a 28% improvement over the state-of-the-art by reducing BER from 16.7 to 12.0 on the ViSha dataset.

While single image shadow detection has been improving rapidly in recent years, video shadow detection remains a challenging task due to data scarcity and the difficulty in modelling temporal consistency. The current video shadow detection method achieves this goal via co-attention, which mostly exploits information that is temporally coherent but is not robust in detecting moving shadows and small shadow regions. In this paper, we propose a simple but powerful method to better aggregate information temporally. We use an optical flow based warping module to align and then combine features between frames. We apply this warping module across multiple deep-network layers to retrieve information from neighboring frames including both local details and high-level semantic information. We train and test our framework on the ViSha dataset. Experimental results show that our model outperforms the state-of-the-art video shadow detection method by 28%, reducing BER from 16.7 to 12.0.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes