CVMMJun 11, 2021

Shallow Optical Flow Three-Stream CNN for Macro- and Micro-Expression Spotting from Long Videos

arXiv:2106.06489v151 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of detecting subtle facial expressions in video analysis for applications like emotion recognition, though it appears incremental as it builds on existing CNN and optical flow methods.

The paper tackled the challenge of spotting micro-expressions in long videos by proposing SOFTNet, a shallow optical flow three-stream CNN model that frames spotting as a regression problem with pseudo-labeling, achieving state-of-the-art performance on the MEGC 2020 benchmark with CAS(ME)^2 and promising results on SAMM Long Videos.

Facial expressions vary from the visible to the subtle. In recent years, the analysis of micro-expressions $-$ a natural occurrence resulting from the suppression of one's true emotions, has drawn the attention of researchers with a broad range of potential applications. However, spotting microexpressions in long videos becomes increasingly challenging when intertwined with normal or macro-expressions. In this paper, we propose a shallow optical flow three-stream CNN (SOFTNet) model to predict a score that captures the likelihood of a frame being in an expression interval. By fashioning the spotting task as a regression problem, we introduce pseudo-labeling to facilitate the learning process. We demonstrate the efficacy and efficiency of the proposed approach on the recent MEGC 2020 benchmark, where state-of-the-art performance is achieved on CAS(ME)$^{2}$ with equally promising results on SAMM Long Videos.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes