CVLGDec 31, 2024

Spatio-Temporal Multi-Subgraph GCN for 3D Human Motion Prediction

arXiv:2501.00317v15 citationsh-index: 4ICASSP
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem in computer vision for applications like animation or robotics, with incremental improvements over existing GCN-based methods.

The paper tackled the problem of 3D human motion prediction by proposing the STMS-GCN to capture complex spatio-temporal dependencies, achieving superior performance on standard benchmarks.

Human motion prediction (HMP) involves forecasting future human motion based on historical data. Graph Convolutional Networks (GCNs) have garnered widespread attention in this field for their proficiency in capturing relationships among joints in human motion. However, existing GCN-based methods tend to focus on either temporal-domain or spatial-domain features, or they combine spatio-temporal features without fully leveraging the complementarity and cross-dependency of these two features. In this paper, we propose the Spatial-Temporal Multi-Subgraph Graph Convolutional Network (STMS-GCN) to capture complex spatio-temporal dependencies in human motion. Specifically, we decouple the modeling of temporal and spatial dependencies, enabling cross-domain knowledge transfer at multiple scales through a spatio-temporal information consistency constraint mechanism. Besides, we utilize multiple subgraphs to extract richer motion information and enhance the learning associations of diverse subgraphs through a homogeneous information constraint mechanism. Extensive experiments on the standard HMP benchmarks demonstrate the superiority of our method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes