CVApr 21, 2022

Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue

arXiv:2204.10312v19.422 citationsh-index: 35Has Code

Originality Highly original

AI Analysis

This addresses the problem of recognizing human actions from skeleton data without labeled supervision, which is incremental as it builds on prior unsupervised approaches with specific improvements.

The paper tackles skeleton-based unsupervised human action recognition by proposing a method with graph Laplacian regularization and self-supervised viewpoint invariance, achieving state-of-the-art results on NTU-60 and NTU-120 datasets and even surpassing some supervised methods.

This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolutional autoencoder that uses graph Laplacian regularization to model the skeletal geometry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer that ensures generalization across camera views. The proposed method is validated on NTU-60 and NTU-120 large-scale datasets in which it outperforms all prior unsupervised skeleton-based approaches on the cross-subject, cross-view, and cross-setup protocols. Although unsupervised, our learnable representation allows our method even to surpass a few supervised skeleton-based action recognition methods. The code is available in: www.github.com/IIT-PAVIS/UHAR_Skeletal_Laplacian

View on arXiv PDF Code

Similar