CVNov 24, 2022

Cross-domain Transfer of defect features in technical domains based on partial target data

arXiv:2211.13662v31.42 citationsh-index: 6

Originality Incremental advance

AI Analysis

This addresses data scarcity in defect classification for technical applications, but it is incremental as it builds on existing contrastive learning and transfer learning methods.

The paper tackles the problem of insufficient training data for defect classes in technical domains by proposing a CNN encoder with a modified triplet loss that leverages non-defective target data and a related source dataset. It shows improved domain generalization and classification results, allowing for larger domain shifts.

A common challenge in real world classification scenarios with sequentially appending target domain data is insufficient training datasets during the training phase. Therefore, conventional deep learning and transfer learning classifiers are not applicable especially when individual classes are not represented or are severely underrepresented at the outset. In many technical domains, however, it is only the defect or worn reject classes that are insufficiently represented, while the non-defect class is often available from the beginning. The proposed classification approach addresses such conditions and is based on a CNN encoder. Following a contrastive learning approach, it is trained with a modified triplet loss function using two datasets: Besides the non-defective target domain class 1st dataset, a state-of-the-art labeled source domain dataset that contains highly related classes e.g., a related manufacturing error or wear defect but originates from a highly different domain e.g., different product, material, or appearance = 2nd dataset is utilized. The approach learns the classification features from the source domain dataset while at the same time learning the differences between the source and the target domain in a single training step, aiming to transfer the relevant features to the target domain. The classifier becomes sensitive to the classification features and by architecture robust against the highly domain-specific context. The approach is benchmarked in a technical and a non-technical domain and shows convincing classification results. In particular, it is shown that the domain generalization capabilities and classification results are improved by the proposed architecture, allowing for larger domain shifts between source and target domains.

View on arXiv PDF

Similar