CVMay 24, 2021

Dynamic Class Queue for Large Scale Face Recognition In the Wild

arXiv:2105.11113v132 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses efficiency and class imbalance issues in face recognition for real-world applications, representing an incremental improvement over existing methods.

The paper tackles the challenges of computing resource constraints and long-tailed class distribution in large-scale face recognition by proposing a dynamic class queue (DCQ) method, achieving similar performance with only 10% of classes and showing clear improvement on the Megaface Challenge2 dataset with 672K identities.

Learning discriminative representation using large-scale face datasets in the wild is crucial for real-world applications, yet it remains challenging. The difficulties lie in many aspects and this work focus on computing resource constraint and long-tailed class distribution. Recently, classification-based representation learning with deep neural networks and well-designed losses have demonstrated good recognition performance. However, the computing and memory cost linearly scales up to the number of identities (classes) in the training set, and the learning process suffers from unbalanced classes. In this work, we propose a dynamic class queue (DCQ) to tackle these two problems. Specifically, for each iteration during training, a subset of classes for recognition are dynamically selected and their class weights are dynamically generated on-the-fly which are stored in a queue. Since only a subset of classes is selected for each iteration, the computing requirement is reduced. By using a single server without model parallel, we empirically verify in large-scale datasets that 10% of classes are sufficient to achieve similar performance as using all classes. Moreover, the class weights are dynamically generated in a few-shot manner and therefore suitable for tail classes with only a few instances. We show clear improvement over a strong baseline in the largest public dataset Megaface Challenge2 (MF2) which has 672K identities and over 88% of them have less than 10 instances. Code is available at https://github.com/bilylee/DCQ

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes