CVMay 7

Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness

arXiv:2602.1363660.5h-index: 15Has Code
AI Analysis

For UAV visual tracking, LGTrack addresses the accuracy-efficiency trade-off and occlusion robustness, offering a practical solution for real-time applications.

LGTrack introduces a unified UAV tracking framework with dynamic layer selection and efficient feature enhancement, achieving 258.7 FPS on UAVDT and 82.8% precision, balancing speed and accuracy under occlusion.

Visual object tracking (VOT) plays a pivotal role in unmanned aerial vehicle (UAV) applications. Addressing the trade-off between accuracy and efficiency, especially under challenging conditions like unpredictable occlusion, remains a significant challenge. This paper introduces LGTrack, a unified UAV tracking framework that integrates dynamic layer selection, efficient feature enhancement, and robust representation learning for occlusions. By employing a novel lightweight Global-Grouped Coordinate Attention (GGCA) module, LGTrack captures long-range dependencies and global contexts, enhancing feature discriminability with minimal computational overhead. Additionally, a lightweight Similarity-Guided Layer Adaptation (SGLA) module replaces knowledge distillation, achieving an optimal balance between tracking precision and inference efficiency. Experiments on three datasets demonstrate LGTrack's state-of-the-art real-time speed (258.7 FPS on UAVDT) while maintaining competitive tracking accuracy (82.8\% precision). Code is available at https://github.com/XiaoMoc/LGTrack

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes