CVRODec 10, 2025

CS3D: An Efficient Facial Expression Recognition via Event Vision

arXiv:2512.09592v11 citationsh-index: 1ROBIO
Originality Incremental advance
AI Analysis

This work addresses energy efficiency for facial expression recognition in human-robot interaction, offering an incremental improvement for edge computing applications.

The paper tackles the challenge of deploying deep learning models for facial expression recognition on edge devices by proposing the CS3D framework, which reduces computational complexity and energy consumption while achieving higher accuracy on multiple datasets, with energy consumption at 21.97% of the original C3D method.

Responsive and accurate facial expression recognition is crucial to human-robot interaction for daily service robots. Nowadays, event cameras are becoming more widely adopted as they surpass RGB cameras in capturing facial expression changes due to their high temporal resolution, low latency, computational efficiency, and robustness in low-light conditions. Despite these advantages, event-based approaches still encounter practical challenges, particularly in adopting mainstream deep learning models. Traditional deep learning methods for facial expression analysis are energy-intensive, making them difficult to deploy on edge computing devices and thereby increasing costs, especially for high-frequency, dynamic, event vision-based approaches. To address this challenging issue, we proposed the CS3D framework by decomposing the Convolutional 3D method to reduce the computational complexity and energy consumption. Additionally, by utilizing soft spiking neurons and a spatial-temporal attention mechanism, the ability to retain information is enhanced, thus improving the accuracy of facial expression detection. Experimental results indicate that our proposed CS3D method attains higher accuracy on multiple datasets compared to architectures such as the RNN, Transformer, and C3D, while the energy consumption of the CS3D method is just 21.97\% of the original C3D required on the same device.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes