NECVNov 12, 2015

Going Deeper in Facial Expression Recognition using Deep Neural Networks

arXiv:1511.04110v1955 citations
Originality Incremental advance
AI Analysis

This addresses generalizability issues in automated facial expression recognition for computer vision applications, though it appears incremental as it builds on existing deep learning approaches.

The paper tackles the problem of facial expression recognition lacking generalizability to unseen images by proposing a deep neural network architecture with two convolutional layers and four Inception layers, achieving results comparable to or better than state-of-the-art methods across seven public datasets.

Automated Facial Expression Recognition (FER) has remained a challenging and interesting problem. Despite efforts made in developing various methods for FER, existing approaches traditionally lack generalizability when applied to unseen images or those that are captured in wild setting. Most of the existing approaches are based on engineered features (e.g. HOG, LBPH, and Gabor) where the classifier's hyperparameters are tuned to give best recognition accuracies across a single database, or a small collection of similar databases. Nevertheless, the results are not significant when they are applied to novel data. This paper proposes a deep neural network architecture to address the FER problem across multiple well-known standard face datasets. Specifically, our network consists of two convolutional layers each followed by max pooling and then four Inception layers. The network is a single component architecture that takes registered facial images as the input and classifies them into either of the six basic or the neutral expressions. We conducted comprehensive experiments on seven publically available facial expression databases, viz. MultiPIE, MMI, CK+, DISFA, FERA, SFEW, and FER2013. The results of proposed architecture are comparable to or better than the state-of-the-art methods and better than traditional convolutional neural networks and in both accuracy and training time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes