CVMay 18, 2021

Non-contact Pain Recognition from Video Sequences with Remote Physiological Measurements Prediction

arXiv:2105.08822v211 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of objective and convenient pain recognition for medical applications, though it is incremental as it builds on existing multi-modal approaches.

The paper tackles non-contact pain recognition from video by proposing a multi-task learning framework that integrates appearance changes and remote physiological cues (rPPG) to overcome limitations of existing methods, achieving state-of-the-art performance on public pain databases.

Automatic pain recognition is paramount for medical diagnosis and treatment. The existing works fall into three categories: assessing facial appearance changes, exploiting physiological cues, or fusing them in a multi-modal manner. However, (1) appearance changes are easily affected by subjective factors which impedes objective pain recognition. Besides, the appearance-based approaches ignore long-range spatial-temporal dependencies that are important for modeling expressions over time; (2) the physiological cues are obtained by attaching sensors on human body, which is inconvenient and uncomfortable. In this paper, we present a novel multi-task learning framework which encodes both appearance changes and physiological cues in a non-contact manner for pain recognition. The framework is able to capture both local and long-range dependencies via the proposed attention mechanism for the learned appearance representations, which are further enriched by temporally attended physiological cues (remote photoplethysmography, rPPG) that are recovered from videos in the auxiliary task. This framework is dubbed rPPG-enriched Spatio-Temporal Attention Network (rSTAN) and allows us to establish the state-of-the-art performance of non-contact pain recognition on publicly available pain databases. It demonstrates that rPPG predictions can be used as an auxiliary task to facilitate non-contact automatic pain recognition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes