CVAISep 13, 2018

Part-based Graph Convolutional Network for Action Recognition

arXiv:1809.04983v1180 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of improving action recognition accuracy for applications like surveillance or human-computer interaction, representing an incremental advance by adapting part-based models to graph networks.

The paper tackles action recognition from skeletal videos by introducing a part-based graph convolutional network (PB-GCN) that divides the skeleton into subgraphs and uses relative coordinates and temporal displacements as features, achieving state-of-the-art performance on NTURGB+D and HDM05 datasets.

Human actions comprise of joint motion of articulated body parts or `gestures'. Human skeleton is intuitively represented as a sparse graph with joints as nodes and natural connections between them as edges. Graph convolutional networks have been used to recognize actions from skeletal videos. We introduce a part-based graph convolutional network (PB-GCN) for this task, inspired by Deformable Part-based Models (DPMs). We divide the skeleton graph into four subgraphs with joints shared across them and learn a recognition model using a part-based graph convolutional network. We show that such a model improves performance of recognition, compared to a model using entire skeleton graph. Instead of using 3D joint coordinates as node features, we show that using relative coordinates and temporal displacements boosts performance. Our model achieves state-of-the-art performance on two challenging benchmark datasets NTURGB+D and HDM05, for skeletal action recognition.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes