TY-RIST: Tactical YOLO Tricks for Real-time Infrared Small Target Detection
This addresses challenges in defense and surveillance by enhancing real-time detection of small infrared targets, though it appears incremental as it builds on YOLO with specific optimizations.
The paper tackled the problem of infrared small target detection by proposing TY-RIST, an optimized YOLOv12n architecture that improved mAP by +7.9%, Precision by +3%, and Recall by +10.2% while reducing computational cost by about 25.5% and achieving up to 123 FPS.
Infrared small target detection (IRSTD) is critical for defense and surveillance but remains challenging due to (1) target loss from minimal features, (2) false alarms in cluttered environments, (3) missed detections from low saliency, and (4) high computational costs. To address these issues, we propose TY-RIST, an optimized YOLOv12n architecture that integrates (1) a stride-aware backbone with fine-grained receptive fields, (2) a high-resolution detection head, (3) cascaded coordinate attention blocks, and (4) a branch pruning strategy that reduces computational cost by about 25.5% while marginally improving accuracy and enabling real-time inference. We also incorporate the Normalized Gaussian Wasserstein Distance (NWD) to enhance regression stability. Extensive experiments on four benchmarks and across 20 different models demonstrate state-of-the-art performance, improving mAP at 0.5 IoU by +7.9%, Precision by +3%, and Recall by +10.2%, while achieving up to 123 FPS on a single GPU. Cross-dataset validation on a fifth dataset further confirms strong generalization capability. Additional results and resources are available at https://www.github.com/moured/TY-RIST