CVMay 19

Dual-Prompt CLIP with Hybrid Visual Encoders for Occluded Person Re-Identification

arXiv:2605.1952718.3Has Code
Predicted impact top 32% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For person re-identification in surveillance, this work improves robustness to occlusion, a key practical challenge, but the gains are incremental over existing CLIP-based methods.

DPL-ReID tackles occluded person re-identification by introducing a dual prompt learning strategy and a real-world occlusion augmentation method, achieving state-of-the-art performance on benchmark datasets.

Occluded person re-identification focuses on matching partially visible pedestrians across multiple camera views. However, occlusions disrupt body-region cues, thereby complicating cross-view matching. Most person ReID methods built on pretrained vision-language models only focus on enhancing prompt-based feature learning while ignoring the semantic information of occluders. Based on the success of CLIP-ReID, we propose a novel Dual Prompt Learning ReID (DPL-ReID) model for occluded person ReID. It incorporates a Dual Prompt Learning (Dual-PL) strategy, which can utilize textual cues to capture complete pedestrian semantics and keep robustness against occlusion, and a Real-World Occlusion Augmentation (RWOA) method that realistically simulates occlusion scenarios encountered in real word to enrich occluded samples. In addition, we also design a Weighted Gated Feature Fusion (WGFF) method, which in corporates LSNet to capture global information and act as a feature-gating mechanism. This mechanism can effectively guide the CLIP visual encoder toward generating more comprehensive feature representations. Extensive experiments on several benchmark occluded ReID datasets show that our proposed DPL-ReID achieves the state-of-the art performance. The occlusion instance library are available at https://github.com/stone-qiao/DPL-ReID.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes