CVOct 9, 2021

Google Landmark Retrieval 2021 Competition Third Place Solution

arXiv:2110.04619v1
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for landmark retrieval and recognition competitions, focusing on enhancing performance in a specific domain.

The authors tackled the Google Landmark Challenges 2021 by using ensembles of transformers and ConvNet models with Sub-center ArcFace and dynamic margins, achieving third place in retrieval and fourth in recognition.

We present our solutions to the Google Landmark Challenges 2021, for both the retrieval and the recognition tracks. Both solutions are ensembles of transformers and ConvNet models based on Sub-center ArcFace with dynamic margins. Since the two tracks share the same training data, we used the same pipeline and training approach, but with different model selections for the ensemble and different post-processing. The key improvement over last year is newer state-of-the-art vision architectures, especially transformers which significantly outperform ConvNets for the retrieval task. We finished third and fourth places for the retrieval and recognition tracks respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes