CVApr 17

MMGait: Towards Multi-Modal Gait Recognition

Chenye Wang, Qingyuan Cai, Saihui Hou, Aoqi Li, Yongzhen Huang

arXiv:2604.1597963.7h-index: 23Has Code

AI Analysis

This work provides a comprehensive benchmark and baseline for multi-modal gait recognition, enabling systematic study of modality robustness and complementarity for the biometrics community.

The paper introduces MMGait, a large-scale multi-modal gait recognition benchmark with 12 modalities from 5 sensors, and proposes OmniGait, a baseline that learns a shared embedding space across modalities, achieving promising recognition performance.

Gait recognition has emerged as a powerful biometric technique for identifying individuals at a distance without requiring user cooperation. Most existing methods focus primarily on RGB-derived modalities, which fall short in real-world scenarios requiring multi-modal collaboration and cross-modal retrieval. To overcome these challenges, we present MMGait, a comprehensive multi-modal gait benchmark integrating data from five heterogeneous sensors, including an RGB camera, a depth camera, an infrared camera, a LiDAR scanner, and a 4D Radar system. MMGait contains twelve modalities and 334,060 sequences from 725 subjects, enabling systematic exploration across geometric, photometric, and motion domains. Based on MMGait, we conduct extensive evaluations on single-modal, cross-modal, and multi-modal paradigms to analyze modality robustness and complementarity. Furthermore, we introduce a new task, Omni Multi-Modal Gait Recognition, which aims to unify the above three gait recognition paradigms within a single model. We also propose a simple yet powerful baseline, OmniGait, which learns a shared embedding space across diverse modalities and achieves promising recognition performance. The MMGait benchmark, codebase, and pretrained checkpoints are publicly available at https://github.com/BNU-IVC/MMGait.

View on arXiv PDF Code

Similar