CVNov 13, 2024

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere

arXiv:2411.08470v214 citationsh-index: 12ICLR
Originality Incremental advance
AI Analysis

This addresses privacy and ethical concerns in face recognition by providing a synthetic dataset generation method, though it is incremental as it builds on existing generative models and embedding techniques.

The paper tackles the challenge of generating synthetic face recognition datasets with sufficient inter-class variation by formulating it as a packing problem on a hypersphere embedding space, and results show that models trained with their HyperFace approach achieve state-of-the-art performance on real benchmarks.

Face recognition datasets are often collected by crawling Internet and without individuals' consents, raising ethical and privacy concerns. Generating synthetic datasets for training face recognition models has emerged as a promising alternative. However, the generation of synthetic datasets remains challenging as it entails adequate inter-class and intra-class variations. While advances in generative models have made it easier to increase intra-class variations in face datasets (such as pose, illumination, etc.), generating sufficient inter-class variation is still a difficult task. In this paper, we formulate the dataset generation as a packing problem on the embedding space (represented on a hypersphere) of a face recognition model and propose a new synthetic dataset generation approach, called HyperFace. We formalize our packing problem as an optimization problem and solve it with a gradient descent-based approach. Then, we use a conditional face generator model to synthesize face images from the optimized embeddings. We use our generated datasets to train face recognition models and evaluate the trained models on several benchmarking real datasets. Our experimental results show that models trained with HyperFace achieve state-of-the-art performance in training face recognition using synthetic datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes