CVOct 22, 2025

Towards Single-Source Domain Generalized Object Detection via Causal Visual Prompts

arXiv:2510.19487v1h-index: 13
Originality Incremental advance
AI Analysis

This addresses domain generalization in object detection for computer vision applications, but is incremental as it builds on existing prompt-based and causal methods.

The paper tackles the problem of single-source domain generalized object detection, where models trained on one domain fail to generalize to unseen domains due to spurious correlations like color. The proposed Cauvis method achieves state-of-the-art performance with 15.9-31.4% gains over existing methods on SDGOD datasets.

Single-source Domain Generalized Object Detection (SDGOD), as a cutting-edge research topic in computer vision, aims to enhance model generalization capability in unseen target domains through single-source domain training. Current mainstream approaches attempt to mitigate domain discrepancies via data augmentation techniques. However, due to domain shift and limited domain-specific knowledge, models tend to fall into the pitfall of spurious correlations. This manifests as the model's over-reliance on simplistic classification features (e.g., color) rather than essential domain-invariant representations like object contours. To address this critical challenge, we propose the Cauvis (Causal Visual Prompts) method. First, we introduce a Cross-Attention Prompts module that mitigates bias from spurious features by integrating visual prompts with cross-attention. To address the inadequate domain knowledge coverage and spurious feature entanglement in visual prompts for single-domain generalization, we propose a dual-branch adapter that disentangles causal-spurious features while achieving domain adaptation via high-frequency feature extraction. Cauvis achieves state-of-the-art performance with 15.9-31.4% gains over existing domain generalization methods on SDGOD datasets, while exhibiting significant robustness advantages in complex interference environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes