CVMar 19, 2020

Temporal Extension Module for Skeleton-Based Action Recognition

arXiv:2003.08951v239 citations
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in action recognition for computer vision applications, but it is incremental as it builds on existing GCN methods.

The paper tackles the problem of optimizing the temporal graph in skeleton-based action recognition by adding connections to neighboring multiple vertices on the inter-frame, and the result is that their module achieves state-of-the-art performance on NTU RGB+D and Kinetics-Skeleton datasets.

We present a module that extends the temporal graph of a graph convolutional network (GCN) for action recognition with a sequence of skeletons. Existing methods attempt to represent a more appropriate spatial graph on an intra-frame, but disregard optimization of the temporal graph on the interframe. Concretely, these methods connect between vertices corresponding only to the same joint on the inter-frame. In this work, we focus on adding connections to neighboring multiple vertices on the inter-frame and extracting additional features based on the extended temporal graph. Our module is a simple yet effective method to extract correlated features of multiple joints in human movement. Moreover, our module aids in further performance improvements, along with other GCN methods that optimize only the spatial graph. We conduct extensive experiments on two large datasets, NTU RGB+D and Kinetics-Skeleton, and demonstrate that our module is effective for several existing models and our final model achieves state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes