CVMay 15, 2024

SARATR-X: Toward Building A Foundation Model for SAR Target Recognition

arXiv:2405.09365v570 citationsh-index: 8Has CodeIEEE Transactions on Image Processing
Originality Highly original
AI Analysis

This work addresses the need for more efficient and scalable target recognition in SAR imagery, which is crucial for applications like surveillance and remote sensing, by proposing a foundational approach that reduces reliance on expensive labeled data.

The paper tackles the problem of limited generalization and scalability in synthetic aperture radar automatic target recognition (SAR ATR) by introducing SARATR-X, a foundation model trained on 0.18 million unlabeled SAR samples, which achieves competitive or superior performance in classification and detection tasks compared to prior supervised and self-supervised methods.

Despite the remarkable progress in synthetic aperture radar automatic target recognition (SAR ATR), recent efforts have concentrated on detecting and classifying a specific category, e.g., vehicles, ships, airplanes, or buildings. One of the fundamental limitations of the top-performing SAR ATR methods is that the learning paradigm is supervised, task-specific, limited-category, closed-world learning, which depends on massive amounts of accurately annotated samples that are expensively labeled by expert SAR analysts and have limited generalization capability and scalability. In this work, we make the first attempt towards building a foundation model for SAR ATR, termed SARATR-X. SARATR-X learns generalizable representations via self-supervised learning (SSL) and provides a cornerstone for label-efficient model adaptation to generic SAR target detection and classification tasks. Specifically, SARATR-X is trained on 0.18 M unlabelled SAR target samples, which are curated by combining contemporary benchmarks and constitute the largest publicly available dataset till now. Considering the characteristics of SAR images, a backbone tailored for SAR ATR is carefully designed, and a two-step SSL method endowed with multi-scale gradient features was applied to ensure the feature diversity and model scalability of SARATR-X. The capabilities of SARATR-X are evaluated on classification under few-shot and robustness settings and detection across various categories and scenes, and impressive performance is achieved, often competitive with or even superior to prior fully supervised, semi-supervised, or self-supervised algorithms. Our SARATR-X and the curated dataset are released at https://github.com/waterdisappear/SARATR-X to foster research into foundation models for SAR image interpretation.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes