CVMar 17

Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation

arXiv:2603.1644460.7h-index: 17Has Code
Predicted impact top 75% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This enables efficient real-time hand reconstruction for VR/AR and other applications on resource-constrained devices, but it is incremental as it builds on existing methods.

The paper tackles the problem of slow and heavy 3D hand reconstruction models by using lightweight neural networks and knowledge distillation to accelerate the HaMeR model, achieving 1.5x faster inference speed with only a 0.4mm accuracy drop.

Fast and accurate 3D hand reconstruction is essential for real-time applications in VR/AR, human-computer interaction, robotics, and healthcare. Most state-of-the-art methods rely on heavy models, limiting their use on resource-constrained devices like headsets, smartphones, and embedded systems. In this paper, we investigate how the use of lightweight neural networks, combined with Knowledge Distillation, can accelerate complex 3D hand reconstruction models by making them faster and lighter, while maintaining comparable reconstruction accuracy. While our approach is suited for various hand reconstruction frameworks, we focus primarily on boosting the HaMeR model, currently the leading method in terms of reconstruction accuracy. We replace its original ViT-H backbone with lighter alternatives, including MobileNet, MobileViT, ConvNeXt, and ResNet, and evaluate three knowledge distillation strategies: output-level, feature-level, and a hybrid of both. Our experiments show that using lightweight backbones that are only 35% the size of the original achieves 1.5x faster inference speed while preserving similar performance quality with only a minimal accuracy difference of 0.4mm. More specifically, we show how output-level distillation notably improves student performance, while feature-level distillation proves more effective for higher-capacity students. Overall, the findings pave the way for efficient real-world applications on low-power devices. The code and models are publicly available under https://github.com/hunainahmedj/Fast-HaMeR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes