CVMar 3

E2E-GNet: An End-to-End Skeleton-based Geometric Deep Neural Network for Human Motion Recognition

arXiv:2603.02477v11.5h-index: 15

Originality Highly original

AI Analysis

This work addresses the problem of human motion recognition for the computer vision community, providing an incremental improvement in the field of geometric deep learning.

The authors tackled the problem of skeleton-based human motion recognition and achieved a higher motion recognition rate by introducing a geometric transformation layer and a distortion-aware optimization layer. E2E-GNet outperformed other methods across five datasets.

Geometric deep learning has recently gained significant attention in the computer vision community for its ability to capture meaningful representations of data lying in a non-Euclidean space. To this end, we propose E2E-GNet, an end-to-end geometric deep neural network for skeleton-based human motion recognition. To enhance the discriminative power between different motions in the non-Euclidean space, E2E-GNet introduces a geometric transformation layer that jointly optimizes skeleton motion sequences on this space and applies a differentiable logarithm map activation to project them onto a linear space. Building on this, we further design a distortion-aware optimization layer that limits skeleton shape distortions caused by this projection, enabling the network to retain discriminative geometric cues and achieve a higher motion recognition rate. We demonstrate the impact of each layer through ablation studies and extensive experiments across five datasets spanning three domains show that E2E-GNet outperforms other methods with lower cost.

View on arXiv PDF

Similar