CVJun 30, 2022

Spatial Transformer Network with Transfer Learning for Small-scale Fine-grained Skeleton-based Tai Chi Action Recognition

arXiv:2206.15002v15 citationsh-index: 52
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of fine-grained action recognition in a specific domain (Tai Chi) with limited data, though it is incremental as it adapts existing methods.

The paper tackled the problem of recognizing small-scale fine-grained Tai Chi actions by proposing a transfer learning method using a pre-trained Transformer network on the NTU RGB+D dataset, achieving state-of-the-art performance with high accuracy.

Human action recognition is a quite hugely investigated area where most remarkable action recognition networks usually use large-scale coarse-grained action datasets of daily human actions as inputs to state the superiority of their networks. We intend to recognize our small-scale fine-grained Tai Chi action dataset using neural networks and propose a transfer-learning method using NTU RGB+D dataset to pre-train our network. More specifically, the proposed method first uses a large-scale NTU RGB+D dataset to pre-train the Transformer-based network for action recognition to extract common features among human motion. Then we freeze the network weights except for the fully connected (FC) layer and take our Tai Chi actions as inputs only to train the initialized FC weights. Experimental results show that our general model pipeline can reach a high accuracy of small-scale fine-grained Tai Chi action recognition with even few inputs and demonstrate that our method achieves the state-of-the-art performance compared with previous Tai Chi action recognition methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes