CVAug 7, 2022

Video-based Human Action Recognition using Deep Learning: A Review

arXiv:2208.03775v147 citationsh-index: 44
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for researchers and practitioners in computer vision, but is incremental as it synthesizes existing work without new methods or data.

This review paper surveys the current state-of-the-art in video-based human action recognition using deep learning, analyzing models and their performance based on reported recognition accuracies to identify trends and open problems.

Human action recognition is an important application domain in computer vision. Its primary aim is to accurately describe human actions and their interactions from a previously unseen data sequence acquired by sensors. The ability to recognize, understand, and predict complex human actions enables the construction of many important applications such as intelligent surveillance systems, human-computer interfaces, health care, security, and military applications. In recent years, deep learning has been given particular attention by the computer vision community. This paper presents an overview of the current state-of-the-art in action recognition using video analysis with deep learning techniques. We present the most important deep learning models for recognizing human actions, and analyze them to provide the current progress of deep learning algorithms applied to solve human action recognition problems in realistic videos highlighting their advantages and disadvantages. Based on the quantitative analysis using recognition accuracies reported in the literature, our study identifies state-of-the-art deep architectures in action recognition and then provides current trends and open problems for future works in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes