CVNov 16, 2025

Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection

arXiv:2511.12410v13 citationsHas Code
Originality Highly original
AI Analysis

This work addresses the costly re-annotation issue for road damage detection in new environments, offering a scalable solution for visual inspection systems.

The paper tackles the problem of poor cross-domain generalization in automated pavement defect detection by proposing a self-supervised framework that uses visual prompting to adapt to new environments without labels, achieving robust zero-shot transfer and improved resilience to domain variations across four benchmarks.

The deployment of automated pavement defect detection is often hindered by poor cross-domain generalization. Supervised detectors achieve strong in-domain accuracy but require costly re-annotation for new environments, while standard self-supervised methods capture generic features and remain vulnerable to domain shift. We propose \ours, a self-supervised framework that \emph{visually probes} target domains without labels. \ours introduces a Self-supervised Prompt Enhancement Module (SPEM), which derives defect-aware prompts from unlabeled target data to guide a frozen ViT backbone, and a Domain-Aware Prompt Alignment (DAPA) objective, which aligns prompt-conditioned source and target representations. Experiments on four challenging benchmarks show that \ours consistently outperforms strong supervised, self-supervised, and adaptation baselines, achieving robust zero-shot transfer, improved resilience to domain variations, and high data efficiency in few-shot adaptation. These results highlight self-supervised prompting as a practical direction for building scalable and adaptive visual inspection systems. Source code is publicly available: https://github.com/xixiaouab/PROBE/tree/main

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes