CVAIROIVDec 11, 2023

Navigating Open Set Scenarios for Skeleton-based Action Recognition

arXiv:2312.06330v132 citationsh-index: 39Has CodeAAAI
Originality Incremental advance
AI Analysis

It addresses the challenge of recognizing known actions and rejecting unknown ones in real-world scenarios using skeleton data, which is incremental as it builds on existing open-set methods with a novel adaptation for skeleton-based recognition.

The paper tackles the Open-Set Skeleton-based Action Recognition (OS-SAR) problem by formalizing a benchmark on three datasets and proposing CrossMax, a distance-based cross-modality ensemble method that outperforms existing approaches, achieving state-of-the-art results across all datasets and backbones.

In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones. However, using pure skeleton data in such open-set conditions poses challenges due to the lack of visual background cues and the distinct sparse structure of body pose sequences. In this paper, we tackle the unexplored Open-Set Skeleton-based Action Recognition (OS-SAR) task and formalize the benchmark on three skeleton-based datasets. We assess the performance of seven established open-set approaches on our task and identify their limits and critical generalization issues when dealing with skeleton information. To address these challenges, we propose a distance-based cross-modality ensemble method that leverages the cross-modal alignment of skeleton joints, bones, and velocities to achieve superior open-set recognition performance. We refer to the key idea as CrossMax - an approach that utilizes a novel cross-modality mean max discrepancy suppression mechanism to align latent spaces during training and a cross-modality distance-based logits refinement method during testing. CrossMax outperforms existing approaches and consistently yields state-of-the-art results across all datasets and backbones. The benchmark, code, and models will be released at https://github.com/KPeng9510/OS-SAR.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes