CVDec 7, 2021

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection

arXiv:2112.03649v235 citations
Originality Incremental advance
AI Analysis

This work addresses public security by improving anomaly detection in surveillance videos, offering a domain-specific incremental advance over existing pose-based methods.

The paper tackles the challenge of anomaly detection in surveillance videos by proposing a pose-based method that learns motion representations and reconstructs pose sequences, achieving state-of-the-art performance with an average 4.7% AUC improvement on multiple datasets.

Anomaly detection in surveillance videos is challenging and important for ensuring public security. Different from pixel-based anomaly detection methods, pose-based methods utilize highly-structured skeleton data, which decreases the computational burden and also avoids the negative impact of background noise. However, unlike pixel-based methods, which could directly exploit explicit motion features such as optical flow, pose-based methods suffer from the lack of alternative dynamic representation. In this paper, a novel Motion Embedder (ME) is proposed to provide a pose motion representation from the probability perspective. Furthermore, a novel task-specific Spatial-Temporal Transformer (STT) is deployed for self-supervised pose sequence reconstruction. These two modules are then integrated into a unified framework for pose regularity learning, which is referred to as Motion Prior Regularity Learner (MoPRL). MoPRL achieves the state-of-the-art performance by an average improvement of 4.7% AUC on several challenging datasets. Extensive experiments validate the versatility of each proposed module.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes