CVApr 28, 2024

Enhancing Action Recognition from Low-Quality Skeleton Data via Part-Level Knowledge Distillation

arXiv:2404.18206v113 citationsh-index: 14Signal Processing
Originality Incremental advance
AI Analysis

This work addresses a challenge in skeleton-based action recognition for applications in human-centric video analysis, presenting an incremental improvement through a novel distillation approach.

The paper tackles the problem of improving action recognition from low-quality skeleton data with missing or inaccurate joints by proposing a knowledge distillation framework, achieving enhanced performance on datasets like NTU-RGB+D, Penn Action, and SYSU 3D HOI.

Skeleton-based action recognition is vital for comprehending human-centric videos and has applications in diverse domains. One of the challenges of skeleton-based action recognition is dealing with low-quality data, such as skeletons that have missing or inaccurate joints. This paper addresses the issue of enhancing action recognition using low-quality skeletons through a general knowledge distillation framework. The proposed framework employs a teacher-student model setup, where a teacher model trained on high-quality skeletons guides the learning of a student model that handles low-quality skeletons. To bridge the gap between heterogeneous high-quality and lowquality skeletons, we present a novel part-based skeleton matching strategy, which exploits shared body parts to facilitate local action pattern learning. An action-specific part matrix is developed to emphasize critical parts for different actions, enabling the student model to distill discriminative part-level knowledge. A novel part-level multi-sample contrastive loss achieves knowledge transfer from multiple high-quality skeletons to low-quality ones, which enables the proposed knowledge distillation framework to include training low-quality skeletons that lack corresponding high-quality matches. Comprehensive experiments conducted on the NTU-RGB+D, Penn Action, and SYSU 3D HOI datasets demonstrate the effectiveness of the proposed knowledge distillation framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes