CVJul 23, 2019

Multimodal Age and Gender Classification Using Ear and Profile Face Images

arXiv:1907.10081v134 citations
Originality Incremental advance
AI Analysis

This work addresses soft biometric trait extraction for applications like surveillance or identification, but it is incremental as it builds on existing multimodal and deep learning approaches.

The paper tackled age and gender classification by combining ear and profile face images using multimodal deep learning, achieving very high accuracies and superior results compared to state-of-the-art methods.

In this paper, we present multimodal deep neural network frameworks for age and gender classification, which take input a profile face image as well as an ear image. Our main objective is to enhance the accuracy of soft biometric trait extraction from profile face images by additionally utilizing a promising biometric modality: ear appearance. For this purpose, we provided end-to-end multimodal deep learning frameworks. We explored different multimodal strategies by employing data, feature, and score level fusion. To increase representation and discrimination capability of the deep neural networks, we benefited from domain adaptation and employed center loss besides softmax loss. We conducted extensive experiments on the UND-F, UND-J2, and FERET datasets. Experimental results indicated that profile face images contain a rich source of information for age and gender classification. We found that the presented multimodal system achieves very high age and gender classification accuracies. Moreover, we attained superior results compared to the state-of-the-art profile face image or ear image-based age and gender classification methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes