CVSep 9, 2017

How to Train Triplet Networks with 100K Identities?

arXiv:1709.02940v110.454 citations

Originality Incremental advance

AI Analysis

This addresses scalability issues in face recognition for large datasets, though it is incremental as it builds on existing online hard negative mining methods.

The paper tackles the challenge of training triplet networks with 100K identities in face recognition by proposing subspace learning to split identity spaces, making hard triplets easier to find, which improves performance on the MS-Celeb-1M challenge.

Training triplet networks with large-scale data is challenging in face recognition. Due to the number of possible triplets explodes with the number of samples, previous studies adopt the online hard negative mining(OHNM) to handle it. However, as the number of identities becomes extremely large, the training will suffer from bad local minima because effective hard triplets are difficult to be found. To solve the problem, in this paper, we propose training triplet networks with subspace learning, which splits the space of all identities into subspaces consisting of only similar identities. Combined with the batch OHNM, hard triplets can be found much easier. Experiments on the large-scale MS-Celeb-1M challenge with 100K identities demonstrate that the proposed method can largely improve the performance. In addition, to deal with heavy noise and large-scale retrieval, we also make some efforts on robust noise removing and efficient image retrieval, which are used jointly with the subspace learning to obtain the state-of-the-art performance on the MS-Celeb-1M competition (without external data in Challenge1).

View on arXiv PDF

Similar