CVFeb 17, 2025

WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing

arXiv:2502.11338v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses the need for robust weld defect segmentation in industrial non-destructive testing, offering improved generalization across diverse scenarios, though it is incremental as it adapts an existing foundation model.

The paper tackles the problem of poor cross-scenario generalization in weld defect segmentation from radiographic images by proposing WRT-SAM, a model based on the Segment Anything Model (SAM) with adapter-based integration and prompt generators, achieving a recall of 78.87%, precision of 84.04%, and AUC of 0.9746, setting a new state-of-the-art benchmark.

Radiographic testing is a fundamental non-destructive evaluation technique for identifying weld defects and assessing quality in industrial applications due to its high-resolution imaging capabilities. Over the past decade, deep learning techniques have significantly advanced weld defect identification in radiographic images. However, conventional approaches, which rely on training small-scale, task-specific models on single-scenario datasets, exhibit poor cross-scenario generalization. Recently, the Segment Anything Model (SAM), a pre-trained visual foundation model trained on large-scale datasets, has demonstrated exceptional zero-shot generalization capabilities. Fine-tuning SAM with limited domain-specific data has yielded promising results in fields such as medical image segmentation and anomaly detection. To the best of our knowledge, this work is the first to introduce SAM-based segmentation for general weld radiographic testing images. We propose WRT-SAM, a novel weld radiographic defect segmentation model that leverages SAM through an adapter-based integration with a specialized prompt generator architecture. To improve adaptability to grayscale weld radiographic images, we introduce a frequency prompt generator module, which enhances the model's sensitivity to frequency-domain information. Furthermore, to address the multi-scale nature of weld defects, we incorporate a multi-scale prompt generator module, enabling the model to effectively extract and encode defect information across varying scales. Extensive experimental evaluations demonstrate that WRT-SAM achieves a recall of 78.87%, a precision of 84.04%, and an AUC of 0.9746, setting a new state-of-the-art (SOTA) benchmark. Moreover, the model exhibits superior zero-shot generalization performance, highlighting its potential for practical deployment in diverse radiographic testing scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes