CVMar 6, 2023

Fighting noise and imbalance in Action Unit detection problems

arXiv:2303.02994v11 citationsh-index: 21
Originality Incremental advance
AI Analysis

This work addresses challenges in facial expression analysis for affective computing, but it is incremental as it builds on existing label smoothing techniques.

The paper tackled the problem of noise and class imbalance in Action Unit (AU) detection by proposing Robin Hood Label Smoothing (RHLS), which restrains label smoothing to the majority class, resulting in improved performance and outperforming state-of-the-art methods on the DISFA dataset.

Action Unit (AU) detection aims at automatically caracterizing facial expressions with the muscular activations they involve. Its main interest is to provide a low-level face representation that can be used to assist higher level affective computing tasks learning. Yet, it is a challenging task. Indeed, the available databases display limited face variability and are imbalanced toward neutral expressions. Furthermore, as AU involve subtle face movements they are difficult to annotate so that some of the few provided datapoints may be mislabeled. In this work, we aim at exploiting label smoothing ability to mitigate noisy examples impact by reducing confidence [1]. However, applying label smoothing as it is may aggravate imbalance-based pre-existing under-confidence issue and degrade performance. To circumvent this issue, we propose Robin Hood Label Smoothing (RHLS). RHLS principle is to restrain label smoothing confidence reduction to the majority class. In that extent, it alleviates both the imbalance-based over-confidence issue and the negative impact of noisy majority class examples. From an experimental standpoint, we show that RHLS provides a free performance improvement in AU detection. In particular, by applying it on top of a modern multi-task baseline we get promising results on BP4D and outperform state-of-the-art methods on DISFA.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes