CVJun 1, 2021

Dense Nested Attention Network for Infrared Small Target Detection

arXiv:2106.00487v3821 citations
Originality Incremental advance
AI Analysis

This work addresses a domain-specific challenge in infrared small target detection for applications like surveillance or defense, with incremental improvements over existing deep learning approaches.

The paper tackles the problem of detecting small targets in single-frame infrared images, where existing CNN methods lose targets due to pooling layers, and proposes a dense nested attention network (DNANet) that achieves better performance in terms of probability of detection, false-alarm rate, and intersection of union compared to state-of-the-art methods.

Single-frame infrared small target (SIRST) detection aims at separating small targets from clutter backgrounds. With the advances of deep learning, CNN-based methods have yielded promising results in generic object detection due to their powerful modeling capability. However, existing CNN-based methods cannot be directly applied for infrared small targets since pooling layers in their networks could lead to the loss of targets in deep layers. To handle this problem, we propose a dense nested attention network (DNANet) in this paper. Specifically, we design a dense nested interactive module (DNIM) to achieve progressive interaction among high-level and low-level features. With the repeated interaction in DNIM, infrared small targets in deep layers can be maintained. Based on DNIM, we further propose a cascaded channel and spatial attention module (CSAM) to adaptively enhance multi-level features. With our DNANet, contextual information of small targets can be well incorporated and fully exploited by repeated fusion and enhancement. Moreover, we develop an infrared small target dataset (namely, NUDT-SIRST) and propose a set of evaluation metrics to conduct comprehensive performance evaluation. Experiments on both public and our self-developed datasets demonstrate the effectiveness of our method. Compared to other state-of-the-art methods, our method achieves better performance in terms of probability of detection (Pd), false-alarm rate (Fa), and intersection of union (IoU).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes