CVJul 25, 2022

On Mitigating Hard Clusters for Face Clustering

arXiv:2207.11895v115 citationsh-index: 26Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of misclassification in face recognition systems for applications requiring accurate clustering of heterogeneous face images, though it is incremental as it builds on existing methods.

The paper tackles the challenge of identifying small or sparse face clusters in face clustering by leveraging neighborhood information and probabilistic cluster membership inference, achieving new state-of-the-art performance on multiple benchmarks.

Face clustering is a promising way to scale up face recognition systems using large-scale unlabeled face images. It remains challenging to identify small or sparse face image clusters that we call hard clusters, which is caused by the heterogeneity, \ie, high variations in size and sparsity, of the clusters. Consequently, the conventional way of using a uniform threshold (to identify clusters) often leads to a terrible misclassification for the samples that should belong to hard clusters. We tackle this problem by leveraging the neighborhood information of samples and inferring the cluster memberships (of samples) in a probabilistic way. We introduce two novel modules, Neighborhood-Diffusion-based Density (NDDe) and Transition-Probability-based Distance (TPDi), based on which we can simply apply the standard Density Peak Clustering algorithm with a uniform threshold. Our experiments on multiple benchmarks show that each module contributes to the final performance of our method, and by incorporating them into other advanced face clustering methods, these two modules can boost the performance of these methods to a new state-of-the-art. Code is available at: https://github.com/echoanran/On-Mitigating-Hard-Clusters.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes