CVJul 6, 2023

Active Learning with Contrastive Pre-training for Facial Expression Recognition

arXiv:2307.02744v11.59 citationsh-index: 7Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of high labeling costs for facial expression recognition, offering a practical improvement but is incremental as it builds on existing active learning and pre-training techniques.

The paper tackles the problem of reducing labeled data needs in facial expression recognition by studying active learning methods and finding they suffer from a 'Cold Start' issue, then proposes a contrastive self-supervised pre-training step that improves performance by up to 9.2% over random sampling and 6.7% over existing baselines.

Deep learning has played a significant role in the success of facial expression recognition (FER), thanks to large models and vast amounts of labelled data. However, obtaining labelled data requires a tremendous amount of human effort, time, and financial resources. Even though some prior works have focused on reducing the need for large amounts of labelled data using different unsupervised methods, another promising approach called active learning is barely explored in the context of FER. This approach involves selecting and labelling the most representative samples from an unlabelled set to make the best use of a limited 'labelling budget'. In this paper, we implement and study 8 recent active learning methods on three public FER datasets, FER13, RAF-DB, and KDEF. Our findings show that existing active learning methods do not perform well in the context of FER, likely suffering from a phenomenon called 'Cold Start', which occurs when the initial set of labelled samples is not well representative of the entire dataset. To address this issue, we propose contrastive self-supervised pre-training, which first learns the underlying representations based on the entire unlabelled dataset. We then follow this with the active learning methods and observe that our 2-step approach shows up to 9.2% improvement over random sampling and up to 6.7% improvement over the best existing active learning baseline without the pre-training. We will make the code for this study public upon publication at: github.com/ShuvenduRoy/ActiveFER.

View on arXiv PDF Code

Similar