CVAIAug 20, 2024

Generalizable Facial Expression Recognition

arXiv:2408.10614v118 citationsh-index: 27Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of zero-shot generalization in FER for real-world deployment where acquiring target domain data is infeasible, representing a domain-specific advancement.

The paper tackles the problem of facial expression recognition (FER) models failing on test sets with domain gaps by proposing a novel pipeline that extracts expression-related features using CLIP's generalizable face features and learns sigmoid masks to preserve generalization while improving precision, achieving state-of-the-art performance on five datasets with significant margins.

SOTA facial expression recognition (FER) methods fail on test sets that have domain gaps with the train set. Recent domain adaptation FER methods need to acquire labeled or unlabeled samples of target domains to fine-tune the FER model, which might be infeasible in real-world deployment. In this paper, we aim to improve the zero-shot generalization ability of FER methods on different unseen test sets using only one train set. Inspired by how humans first detect faces and then select expression features, we propose a novel FER pipeline to extract expression-related features from any given face images. Our method is based on the generalizable face features extracted by large models like CLIP. However, it is non-trivial to adapt the general features of CLIP for specific tasks like FER. To preserve the generalization ability of CLIP and the high precision of the FER model, we design a novel approach that learns sigmoid masks based on the fixed CLIP face features to extract expression features. To further improve the generalization ability on unseen test sets, we separate the channels of the learned masked features according to the expression classes to directly generate logits and avoid using the FC layer to reduce overfitting. We also introduce a channel-diverse loss to make the learned masks separated. Extensive experiments on five different FER datasets verify that our method outperforms SOTA FER methods by large margins. Code is available in https://github.com/zyh-uaiaaaa/Generalizable-FER.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes