CVLGApr 19, 2019

Simple yet efficient real-time pose-based action recognition

arXiv:1904.09140v154 citations
Originality Incremental advance
AI Analysis

This addresses the need for autonomous systems, such as in autonomous driving, to recognize human actions in real-time, though it appears incremental by building on existing pose estimation and classification methods.

The paper tackles real-time human action recognition by proposing a pipeline that encodes human pose into a new format called Encoded Human Pose Image (EHPI) for classification, achieving competitive state-of-the-art performance in pose-based action detection while ensuring real-time operation.

Recognizing human actions is a core challenge for autonomous systems as they directly share the same space with humans. Systems must be able to recognize and assess human actions in real-time. In order to train corresponding data-driven algorithms, a significant amount of annotated training data is required. We demonstrated a pipeline to detect humans, estimate their pose, track them over time and recognize their actions in real-time with standard monocular camera sensors. For action recognition, we encode the human pose into a new data format called Encoded Human Pose Image (EHPI) that can then be classified using standard methods from the computer vision community. With this simple procedure we achieve competitive state-of-the-art performance in pose-based action detection and can ensure real-time performance. In addition, we show a use case in the context of autonomous driving to demonstrate how such a system can be trained to recognize human actions using simulation data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes