CVFeb 9, 2025

HyLiFormer: Hyperbolic Linear Attention for Skeleton-based Human Action Recognition

arXiv:2502.05869v15 citationsh-index: 9
Originality Highly original
AI Analysis

This work addresses the efficiency problem for applications of skeleton-based human action recognition, which is incremental as it improves upon existing Transformer-based approaches.

The authors tackled the problem of quadratic computational complexity in Transformers for skeleton-based human action recognition, and their proposed HyLiFormer model reduces computational complexity while preserving model accuracy. The model achieves this on NTU RGB+D and NTU RGB+D 120 datasets.

Transformers have demonstrated remarkable performance in skeleton-based human action recognition, yet their quadratic computational complexity remains a bottleneck for real-world applications. To mitigate this, linear attention mechanisms have been explored but struggle to capture the hierarchical structure of skeleton data. Meanwhile, the Poincaré model, as a typical hyperbolic geometry, offers a powerful framework for modeling hierarchical structures but lacks well-defined operations for existing mainstream linear attention. In this paper, we propose HyLiFormer, a novel hyperbolic linear attention Transformer tailored for skeleton-based action recognition. Our approach incorporates a Hyperbolic Transformation with Curvatures (HTC) module to map skeleton data into hyperbolic space and a Hyperbolic Linear Attention (HLA) module for efficient long-range dependency modeling. Theoretical analysis and extensive experiments on NTU RGB+D and NTU RGB+D 120 datasets demonstrate that HyLiFormer significantly reduces computational complexity while preserving model accuracy, making it a promising solution for efficiency-critical applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes