Bioformers: Embedding Transformers for Ultra-Low Power sEMG-based Gesture Recognition
This work addresses energy-efficient gesture recognition for rehabilitation devices like prosthetic hands, offering incremental improvements in model efficiency and accuracy.
The paper tackles the challenge of low classification accuracy and high energy consumption in sEMG-based gesture recognition by introducing Bioformers, a family of ultra-small attention-based architectures that reduce parameters and operations by 4.9X, achieve state-of-the-art accuracy with a 3.39% improvement via inter-subjects pre-training, and deploy on a microcontroller with 2.72 ms latency, 0.14 mJ energy, and 94.2 kB memory, 8.0X lower than previous state-of-the-art.
Human-machine interaction is gaining traction in rehabilitation tasks, such as controlling prosthetic hands or robotic arms. Gesture recognition exploiting surface electromyographic (sEMG) signals is one of the most promising approaches, given that sEMG signal acquisition is non-invasive and is directly related to muscle contraction. However, the analysis of these signals still presents many challenges since similar gestures result in similar muscle contractions. Thus the resulting signal shapes are almost identical, leading to low classification accuracy. To tackle this challenge, complex neural networks are employed, which require large memory footprints, consume relatively high energy and limit the maximum battery life of devices used for classification. This work addresses this problem with the introduction of the Bioformers. This new family of ultra-small attention-based architectures approaches state-of-the-art performance while reducing the number of parameters and operations of 4.9X. Additionally, by introducing a new inter-subjects pre-training, we improve the accuracy of our best Bioformer by 3.39%, matching state-of-the-art accuracy without any additional inference cost. Deploying our best performing Bioformer on a Parallel, Ultra-Low Power (PULP) microcontroller unit (MCU), the GreenWaves GAP8, we achieve an inference latency and energy of 2.72 ms and 0.14 mJ, respectively, 8.0X lower than the previous state-of-the-art neural network, while occupying just 94.2 kB of memory.