CVMar 8, 2024

SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection

arXiv:2403.05416v135 citationsh-index: 20IEEE Trans Geosci Remote Sens
Originality Incremental advance
AI Analysis

This work addresses the challenge of data scarcity for researchers and practitioners in infrared small target detection, offering an incremental improvement through dataset augmentation.

The paper tackles the problem of limited training samples for infrared small target detection by proposing a negative sample augmentation method that generates a synthetic dataset (SIRST-5K) with massive pseudo-data, resulting in significant improvements in model performance and convergence speed, with outstanding results in probability of detection, false-alarm rate, and intersection over union compared to state-of-the-art methods.

Single-frame infrared small target (SIRST) detection aims to recognize small targets from clutter backgrounds. Recently, convolutional neural networks have achieved significant advantages in general object detection. With the development of Transformer, the scale of SIRST models is constantly increasing. Due to the limited training samples, performance has not been improved accordingly. The quality, quantity, and diversity of the infrared dataset are critical to the detection of small targets. To highlight this issue, we propose a negative sample augmentation method in this paper. Specifically, a negative augmentation approach is proposed to generate massive negatives for self-supervised learning. Firstly, we perform a sequential noise modeling technology to generate realistic infrared data. Secondly, we fuse the extracted noise with the original data to facilitate diversity and fidelity in the generated data. Lastly, we proposed a negative augmentation strategy to enrich diversity as well as maintain semantic invariance. The proposed algorithm produces a synthetic SIRST-5K dataset, which contains massive pseudo-data and corresponding labels. With a rich diversity of infrared small target data, our algorithm significantly improves the model performance and convergence speed. Compared with other state-of-the-art (SOTA) methods, our method achieves outstanding performance in terms of probability of detection (Pd), false-alarm rate (Fa), and intersection over union (IoU).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes