GRCVHCMMApr 23

StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition

arXiv:2604.2168951.4
AI Analysis

For researchers and practitioners in face recognition and creative stylization, this work provides a benchmark and calibration method to address the brittleness of identity encoders under stylization, though it is domain-specific to facial identity.

The paper introduces StyleID, a perception-aware dataset and metric for evaluating facial identity recognition under stylization, and shows that fine-tuning encoders with this data improves correlation with human judgments and robustness to out-of-domain stylized portraits.

Creative face stylization aims to render portraits in diverse visual idioms such as cartoons, sketches, and paintings while retaining recognizable identity. However, current identity encoders, which are typically trained and calibrated on natural photographs, exhibit severe brittleness under stylization. They often mistake changes in texture or color palette for identity drift or fail to detect geometric exaggerations. This reveals the lack of a style-agnostic framework to evaluate and supervise identity consistency across varying styles and strengths. To address this gap, we introduce StyleID, a human perception-aware dataset and evaluation framework for facial identity under stylization. StyleID comprises two datasets: (i) StyleBench-H, a benchmark that captures human same-different verification judgments across diffusion- and flow-matching-based stylization at multiple style strengths, and (ii) StyleBench-S, a supervision set derived from psychometric recognition-strength curves obtained through controlled two-alternative forced-choice (2AFC) experiments. Leveraging StyleBench-S, we fine-tune existing semantic encoders to align their similarity orderings with human perception across styles and strengths. Experiments demonstrate that our calibrated models yield significantly higher correlation with human judgments and enhanced robustness for out-of-domain, artist drawn portraits. All of our datasets, code, and pretrained models are publicly available at https://kwanyun.github.io/StyleID_page/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes