CVAILGMar 27, 2023

Prompt-Guided Zero-Shot Anomaly Action Recognition using Pretrained Deep Skeleton Features

arXiv:2303.15167v147 citationsh-index: 4
Originality Incremental advance
AI Analysis

This work solves the problem of detecting abnormal human behavior in videos without labeled anomalies, which is incremental as it builds on existing skeleton-based approaches.

The study tackled unsupervised anomaly action recognition in videos by addressing three limitations of skeleton-based methods, achieving state-of-the-art performance with an AUC of 0.95 on one dataset and 0.92 on another.

This study investigates unsupervised anomaly action recognition, which identifies video-level abnormal-human-behavior events in an unsupervised manner without abnormal samples, and simultaneously addresses three limitations in the conventional skeleton-based approaches: target domain-dependent DNN training, robustness against skeleton errors, and a lack of normal samples. We present a unified, user prompt-guided zero-shot learning framework using a target domain-independent skeleton feature extractor, which is pretrained on a large-scale action recognition dataset. Particularly, during the training phase using normal samples, the method models the distribution of skeleton features of the normal actions while freezing the weights of the DNNs and estimates the anomaly score using this distribution in the inference phase. Additionally, to increase robustness against skeleton errors, we introduce a DNN architecture inspired by a point cloud deep learning paradigm, which sparsely propagates the features between joints. Furthermore, to prevent the unobserved normal actions from being misidentified as abnormal actions, we incorporate a similarity score between the user prompt embeddings and skeleton features aligned in the common space into the anomaly score, which indirectly supplements normal actions. On two publicly available datasets, we conduct experiments to test the effectiveness of the proposed method with respect to abovementioned limitations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes