CVAILGAug 30, 2024

One-Frame Calibration with Siamese Network in Facial Action Unit Recognition

arXiv:2409.00240v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This addresses the challenge of cross-participant generalization in facial expression analysis, offering a practical solution for more accurate AU recognition in applications like emotion detection, though it is incremental by building on existing Siamese network and backbone methods.

The paper tackles the problem of facial action unit recognition by proposing a one-frame calibration method using a single neutral expression image as a reference, which substantially improves performance on datasets like DISFA, DISFA+, and UNBC-McMaster by mitigating facial attribute biases.

Automatic facial action unit (AU) recognition is used widely in facial expression analysis. Most existing AU recognition systems aim for cross-participant non-calibrated generalization (NCG) to unseen faces without further calibration. However, due to the diversity of facial attributes across different identities, accurately inferring AU activation from single images of an unseen face is sometimes infeasible, even for human experts -- it is crucial to first understand how the face appears in its neutral expression, or significant bias may be incurred. Therefore, we propose to perform one-frame calibration (OFC) in AU recognition: for each face, a single image of its neutral expression is used as the reference image for calibration. With this strategy, we develop a Calibrating Siamese Network (CSN) for AU recognition and demonstrate its remarkable effectiveness with a simple iResNet-50 (IR50) backbone. On the DISFA, DISFA+, and UNBC-McMaster datasets, we show that our OFC CSN-IR50 model (a) substantially improves the performance of IR50 by mitigating facial attribute biases (including biases due to wrinkles, eyebrow positions, facial hair, etc.), (b) substantially outperforms the naive OFC method of baseline subtraction as well as (c) a fine-tuned version of this naive OFC method, and (d) also outperforms state-of-the-art NCG models for both AU intensity estimation and AU detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes