CVApr 6, 2021

Non-contact PPG Signal and Heart Rate Estimation with Multi-hierarchical Convolutional Network

arXiv:2104.02260v252 citations
AI Analysis

This work addresses remote physiological monitoring for healthcare applications, presenting an incremental improvement over existing methods.

The study tackled non-contact heart rate estimation from face videos by proposing a multi-hierarchical convolutional network, achieving state-of-the-art results with mean absolute errors of 2.15, 5.57, and 1.75 bpm on three datasets.

Heartbeat rhythm and heart rate (HR) are important physiological parameters of the human body. This study presents an efficient multi-hierarchical spatio-temporal convolutional network that can quickly estimate remote physiological (rPPG) signal and HR from face video clips. First, the facial color distribution characteristics are extracted using a low-level face feature generation (LFFG) module. Then, the three-dimensional (3D) spatio-temporal stack convolution module (STSC) and multi-hierarchical feature fusion module (MHFF) are used to strengthen the spatio-temporal correlation of multi-channel features. In the MHFF, sparse optical flow is used to capture the tiny motion information of faces between frames and generate a self-adaptive region of interest (ROI) skin mask. Finally, the signal prediction module (SP) is used to extract the estimated rPPG signal. The heart rate estimation results show that the proposed network overperforms the state-of-the-art methods on three datasets, 1) UBFC-RPPG, 2) COHFACE, 3) our dataset, with the mean absolute error (MAE) of 2.15, 5.57, 1.75 beats per minute (bpm) respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes