CVLGJan 26

Facial Emotion Recognition on FER-2013 using an EfficientNetB2-Based Approach

arXiv:2601.18228v11.5
Originality Incremental advance
AI Analysis

This work addresses the problem of real-time and edge-based emotion recognition for applications requiring efficient models, though it is incremental as it adapts existing methods to a specific dataset.

The paper tackled facial emotion recognition on the challenging FER-2013 dataset by developing a lightweight EfficientNetB2-based pipeline, achieving a test accuracy of 68.78% with nearly ten times fewer parameters than VGG16 baselines.

Detection of human emotions based on facial images in real-world scenarios is a difficult task due to low image quality, variations in lighting, pose changes, background distractions, small inter-class variations, noisy crowd-sourced labels, and severe class imbalance, as observed in the FER-2013 dataset of 48x48 grayscale images. Although recent approaches using large CNNs such as VGG and ResNet achieve reasonable accuracy, they are computationally expensive and memory-intensive, limiting their practicality for real-time applications. We address these challenges using a lightweight and efficient facial emotion recognition pipeline based on EfficientNetB2, trained using a two-stage warm-up and fine-tuning strategy. The model is enhanced with AdamW optimization, decoupled weight decay, label smoothing (epsilon = 0.06) to reduce annotation noise, and clipped class weights to mitigate class imbalance, along with dropout, mixed-precision training, and extensive real-time data augmentation. The model is trained using a stratified 87.5%/12.5% train-validation split while keeping the official test set intact, achieving a test accuracy of 68.78% with nearly ten times fewer parameters than VGG16-based baselines. Experimental results, including per-class metrics and learning dynamics, demonstrate stable training and strong generalization, making the proposed approach suitable for real-time and edge-based applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes