CVMar 15, 2018

Deep Structure Inference Network for Facial Action Unit Recognition

arXiv:1803.05873v2133 citations
Originality Incremental advance
AI Analysis

This work addresses facial expression analysis for applications like human-computer interaction, but it is incremental as it builds on existing methods for AU recognition.

The paper tackled the problem of facial Action Unit recognition by proposing a deep neural architecture that combines local and global features and replicates message passing between classes, resulting in state-of-the-art improvements of 5.3% on BP4D and 8.2% on DISFA datasets.

Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for developing general facial expression analysis. In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units. In this paper, we propose a deep neural architecture that tackles both problems by combining learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages. We show that by training the model end-to-end with increased supervision we improve state-of-the-art by 5.3% and 8.2% performance on BP4D and DISFA datasets, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes