CVDec 7, 2021

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

Shoubin Yu, Zhongyin Zhao, Haoshu Fang, Andong Deng, Haisheng Su, Dongliang Wang, Weihao Gan, Cewu Lu, Wei Wu

arXiv:2112.03649v211.135 citationsh-index: 54Has Code

Originality Incremental advance

AI Analysis

This work addresses public security by improving anomaly detection in surveillance videos, offering a domain-specific incremental advance over existing pose-based methods.

The paper tackles the challenge of anomaly detection in surveillance videos by proposing a pose-based method that learns motion representations and reconstructs pose sequences, achieving state-of-the-art performance with an average 4.7% AUC improvement on multiple datasets.

Anomaly detection in surveillance videos is challenging and important for ensuring public security. Different from pixel-based anomaly detection methods, pose-based methods utilize highly-structured skeleton data, which decreases the computational burden and also avoids the negative impact of background noise. However, unlike pixel-based methods, which could directly exploit explicit motion features such as optical flow, pose-based methods suffer from the lack of alternative dynamic representation. In this paper, a novel Motion Embedder (ME) is proposed to provide a pose motion representation from the probability perspective. Furthermore, a novel task-specific Spatial-Temporal Transformer (STT) is deployed for self-supervised pose sequence reconstruction. These two modules are then integrated into a unified framework for pose regularity learning, which is referred to as Motion Prior Regularity Learner (MoPRL). MoPRL achieves the state-of-the-art performance by an average improvement of 4.7% AUC on several challenging datasets. Extensive experiments validate the versatility of each proposed module.

View on arXiv PDF Code

Similar