CVIVJun 16, 2025

Overcoming Occlusions in the Wild: A Multi-Task Age Head Approach to Age Estimation

arXiv:2506.13445v12 citationsh-index: 27Pattern Recognition
Originality Incremental advance
AI Analysis

This work addresses the problem of accurate age estimation under occlusion for applications like surveillance or biometrics, representing an incremental improvement with specific gains.

The paper tackles robust facial age estimation from occluded faces in unconstrained scenarios by integrating GANs and transformers, achieving MAEs of 3.00, 4.54, and 2.53 years on FG-NET, UTKFace, and MORPH datasets, surpassing prior state-of-the-art methods.

Facial age estimation has achieved considerable success under controlled conditions. However, in unconstrained real-world scenarios, which are often referred to as 'in the wild', age estimation remains challenging, especially when faces are partially occluded, which may obscure their visibility. To address this limitation, we propose a new approach integrating generative adversarial networks (GANs) and transformer architectures to enable robust age estimation from occluded faces. We employ an SN-Patch GAN to effectively remove occlusions, while an Attentive Residual Convolution Module (ARCM), paired with a Swin Transformer, enhances feature representation. Additionally, we introduce a Multi-Task Age Head (MTAH) that combines regression and distribution learning, further improving age estimation under occlusion. Experimental results on the FG-NET, UTKFace, and MORPH datasets demonstrate that our proposed approach surpasses existing state-of-the-art techniques for occluded facial age estimation by achieving an MAE of $3.00$, $4.54$, and $2.53$ years, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes