CVNov 30, 2024

Dynamic Token Selection for Aerial-Ground Person Re-Identification

arXiv:2412.00433v24 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses inefficiencies in person re-identification for aerial-ground scenarios, which is an incremental improvement for surveillance and security applications.

The paper tackles the problem of Aerial-Ground Person Re-identification by proposing a Dynamic Token Selective Transformer that dynamically selects key tokens to focus on relevant regions, achieving a 1.18% mAP improvement on the CARGO dataset compared to the second-best method.

Aerial-Ground Person Re-identification (AGPReID) holds significant practical value but faces unique challenges due to pronounced variations in viewing angles, lighting conditions, and background interference. Traditional methods, often involving a global analysis of the entire image, frequently lead to inefficiencies and susceptibility to irrelevant data. In this paper, we propose a novel Dynamic Token Selective Transformer (DTST) tailored for AGPReID, which dynamically selects pivotal tokens to concentrate on pertinent regions. Specifically, we segment the input image into multiple tokens, with each token representing a unique region or feature within the image. Using a Top-k strategy, we extract the k most significant tokens that contain vital information essential for identity recognition. Subsequently, an attention mechanism is employed to discern interrelations among diverse tokens, thereby enhancing the representation of identity features. Extensive experiments on benchmark datasets showcases the superiority of our method over existing works. Notably, on the CARGO dataset, our proposed method gains 1.18% mAP improvements when compared to the second place. In addition, we comprehensively analyze the impact of different numbers of tokens, token insertion positions, and numbers of heads on model performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes