CVApr 7, 2023

Graph-Guided MLP-Mixer for Skeleton-Based Human Motion Prediction

arXiv:2304.03532v24 citationsh-index: 28
Originality Incremental advance
AI Analysis

This addresses the limitation of MLP-Mixer in skeleton-based motion prediction for applications like animation or robotics, but it is incremental as it builds on existing MLP-Mixer and GCN methods.

The paper tackles the problem of human motion prediction by proposing Graph-Guided Mixer, which enhances MLP-Mixer to model graph structure, achieving state-of-the-art performance on datasets like Human3.6M, AMASS, and 3DPW.

In recent years, Graph Convolutional Networks (GCNs) have been widely used in human motion prediction, but their performance remains unsatisfactory. Recently, MLP-Mixer, initially developed for vision tasks, has been leveraged into human motion prediction as a promising alternative to GCNs, which achieves both better performance and better efficiency than GCNs. Unlike GCNs, which can explicitly capture human skeleton's bone-joint structure by representing it as a graph with edges and nodes, MLP-Mixer relies on fully connected layers and thus cannot explicitly model such graph-like structure of human's. To break this limitation of MLP-Mixer's, we propose \textit{Graph-Guided Mixer}, a novel approach that equips the original MLP-Mixer architecture with the capability to model graph structure. By incorporating graph guidance, our \textit{Graph-Guided Mixer} can effectively capture and utilize the specific connectivity patterns within human skeleton's graph representation. In this paper, first we uncover a theoretical connection between MLP-Mixer and GCN that is unexplored in existing research. Building on this theoretical connection, next we present our proposed \textit{Graph-Guided Mixer}, explaining how the original MLP-Mixer architecture is reinvented to incorporate guidance from graph structure. Then we conduct an extensive evaluation on the Human3.6M, AMASS, and 3DPW datasets, which shows that our method achieves state-of-the-art performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes