CVFeb 11, 2019

Registration-free Face-SSD: Single shot analysis of smiles, facial attributes, and affect in the wild

Youngkyoon Jang, Hatice Gunes, Ioannis Patras

arXiv:1902.04042v14.750 citationsh-index: 45

Originality Highly original

AI Analysis

This addresses the inefficiency and complexity in computer vision for real-time, multi-task face analysis in the wild, offering a unified solution that is incremental in integrating tasks but novel in its registration-free approach.

The paper tackled the problem of performing multiple face-related tasks like smile detection, attribute prediction, and affect estimation in unconstrained environments by introducing Face-SSD, a single-shot method that eliminates the need for separate pre-processing steps, achieving state-of-the-art results including 95.76% accuracy for smile detection and real-time performance at 21 FPS.

In this paper, we present a novel single shot face-related task analysis method, called Face-SSD, for detecting faces and for performing various face-related (classification/regression) tasks including smile recognition, face attribute prediction and valence-arousal estimation in the wild. Face-SSD uses a Fully Convolutional Neural Network (FCNN) to detect multiple faces of different sizes and recognise/regress one or more face-related classes. Face-SSD has two parallel branches that share the same low-level filters, one branch dealing with face detection and the other one with face analysis tasks. The outputs of both branches are spatially aligned heatmaps that are produced in parallel - therefore Face-SSD does not require that face detection, facial region extraction, size normalisation, and facial region processing are performed in subsequent steps. Our contributions are threefold: 1) Face-SSD is the first network to perform face analysis without relying on pre-processing such as face detection and registration in advance - Face-SSD is a simple and a single FCNN architecture simultaneously performing face detection and face-related task analysis - those are conventionally treated as separate consecutive tasks; 2) Face-SSD is a generalised architecture that is applicable for various face analysis tasks without modifying the network structure - this is in contrast to designing task-specific architectures; and 3) Face-SSD achieves real-time performance (21 FPS) even when detecting multiple faces and recognising multiple classes in a given image. Experimental results show that Face-SSD achieves state-of-the-art performance in various face analysis tasks by reaching a recognition accuracy of 95.76% for smile detection, 90.29% for attribute prediction, and Root Mean Square (RMS) error of 0.44 and 0.39 for valence and arousal estimation.

View on arXiv PDF

Similar