CVApr 21, 2022

Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

arXiv:2204.10312v122 citationsh-index: 35Has Code
AI Analysis

This addresses the problem of recognizing human actions from skeleton data without labeled supervision, which is incremental as it builds on prior unsupervised approaches with specific improvements.

The paper tackles skeleton-based unsupervised human action recognition by proposing a method with graph Laplacian regularization and self-supervised viewpoint invariance, achieving state-of-the-art results on NTU-60 and NTU-120 datasets and even surpassing some supervised methods.

This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolutional autoencoder that uses graph Laplacian regularization to model the skeletal geometry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer that ensures generalization across camera views. The proposed method is validated on NTU-60 and NTU-120 large-scale datasets in which it outperforms all prior unsupervised skeleton-based approaches on the cross-subject, cross-view, and cross-setup protocols. Although unsupervised, our learnable representation allows our method even to surpass a few supervised skeleton-based action recognition methods. The code is available in: www.github.com/IIT-PAVIS/UHAR_Skeletal_Laplacian

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes