CVLGROIVMar 1, 2021

DR-TANet: Dynamic Receptive Temporal Attention Network for Street Scene Change Detection

arXiv:2103.00879v256 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of efficiently detecting changes in street scenes for applications such as autonomous vehicles, representing an incremental improvement over existing methods.

The paper tackled street scene change detection by proposing a dynamic receptive temporal attention network, achieving new state-of-the-art scores on datasets like GSV, TSUNAMI, and VL-CMU-CD while maintaining high efficiency for autonomous vehicles.

Street scene change detection continues to capture researchers' interests in the computer vision community. It aims to identify the changed regions of the paired street-view images captured at different times. The state-of-the-art network based on the encoder-decoder architecture leverages the feature maps at the corresponding level between two channels to gain sufficient information of changes. Still, the efficiency of feature extraction, feature correlation calculation, even the whole network requires further improvement. This paper proposes the temporal attention and explores the impact of the dependency-scope size of temporal attention on the performance of change detection. In addition, based on the Temporal Attention Module (TAM), we introduce a more efficient and light-weight version - Dynamic Receptive Temporal Attention Module (DRTAM) and propose the Concurrent Horizontal and Vertical Attention (CHVA) to improve the accuracy of the network on specific challenging entities. On street scene datasets `GSV', `TSUNAMI' and `VL-CMU-CD', our approach gains excellent performance, establishing new state-of-the-art scores without bells and whistles, while maintaining high efficiency applicable in autonomous vehicles.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes