CVOct 31, 2024

Recovering Complete Actions for Cross-dataset Skeleton Action Recognition

arXiv:2410.23641v15 citationsh-index: 20NIPS
Originality Incremental advance
AI Analysis

This addresses the domain generalization issue in skeleton action recognition, which is crucial for real-world applications, though it appears incremental as it builds on existing augmentation and prior knowledge methods.

The paper tackles the problem of skeleton-based action recognition generalizing poorly across datasets by proposing a recover-and-resample augmentation framework based on a complete action prior, achieving state-of-the-art results in cross-dataset settings with three datasets.

Despite huge progress in skeleton-based action recognition, its generalizability to different domains remains a challenging issue. In this paper, to solve the skeleton action generalization problem, we present a recover-and-resample augmentation framework based on a novel complete action prior. We observe that human daily actions are confronted with temporal mismatch across different datasets, as they are usually partial observations of their complete action sequences. By recovering complete actions and resampling from these full sequences, we can generate strong augmentations for unseen domains. At the same time, we discover the nature of general action completeness within large datasets, indicated by the per-frame diversity over time. This allows us to exploit two assets of transferable knowledge that can be shared across action samples and be helpful for action completion: boundary poses for determining the action start, and linear temporal transforms for capturing global action patterns. Therefore, we formulate the recovering stage as a two-step stochastic action completion with boundary pose-conditioned extrapolation followed by smooth linear transforms. Both the boundary poses and linear transforms can be efficiently learned from the whole dataset via clustering. We validate our approach on a cross-dataset setting with three skeleton action datasets, outperforming other domain generalization approaches by a considerable margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes