CVAug 11, 2022

Locality-aware Attention Network with Discriminative Dynamics Learning for Weakly Supervised Anomaly Detection

arXiv:2208.05636v126 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses video anomaly detection for security and surveillance, but it is incremental as it builds on existing weakly supervised methods by adding dynamics modeling.

The paper tackles weakly supervised video anomaly detection by proposing a Discriminative Dynamics Learning method to model temporal dynamics between snippets, achieving significant improvements on benchmarks like UCF-Crime and XD-Violence.

Video anomaly detection is recently formulated as a multiple instance learning task under weak supervision, in which each video is treated as a bag of snippets to be determined whether contains anomalies. Previous efforts mainly focus on the discrimination of the snippet itself without modeling the temporal dynamics, which refers to the variation of adjacent snippets. Therefore, we propose a Discriminative Dynamics Learning (DDL) method with two objective functions, i.e., dynamics ranking loss and dynamics alignment loss. The former aims to enlarge the score dynamics gap between positive and negative bags while the latter performs temporal alignment of the feature dynamics and score dynamics within the bag. Moreover, a Locality-aware Attention Network (LA-Net) is constructed to capture global correlations and re-calibrate the location preference across snippets, followed by a multilayer perceptron with causal convolution to obtain anomaly scores. Experimental results show that our method achieves significant improvements on two challenging benchmarks, i.e., UCF-Crime and XD-Violence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes