CVNov 8, 2022

Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection

arXiv:2211.04255v18 citationsh-index: 18
Originality Highly original
AI Analysis

This work addresses the need for real-time, efficient violence detection in surveillance systems, offering a novel architecture that reduces computational costs while maintaining high accuracy.

The authors tackled the problem of inefficient feature extraction in deep learning models for violence detection by proposing a lightweight Two-stream Multi-dimensional Convolutional Network (2s-MDCN) that uses RGB frames and optical flow, achieving state-of-the-art accuracy of 89.7% on a benchmark dataset with a 2.2% improvement over single-stream methods.

The increasing number of surveillance cameras and security concerns have made automatic violent activity detection from surveillance footage an active area for research. Modern deep learning methods have achieved good accuracy in violence detection and proved to be successful because of their applicability in intelligent surveillance systems. However, the models are computationally expensive and large in size because of their inefficient methods for feature extraction. This work presents a novel architecture for violence detection called Two-stream Multi-dimensional Convolutional Network (2s-MDCN), which uses RGB frames and optical flow to detect violence. Our proposed method extracts temporal and spatial information independently by 1D, 2D, and 3D convolutions. Despite combining multi-dimensional convolutional networks, our models are lightweight and efficient due to reduced channel capacity, yet they learn to extract meaningful spatial and temporal information. Additionally, combining RGB frames and optical flow yields 2.2% more accuracy than a single RGB stream. Regardless of having less complexity, our models obtained state-of-the-art accuracy of 89.7% on the largest violence detection benchmark dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes