CVMMApr 26, 2025

CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval

arXiv:2504.18782v114 citationsh-index: 5Has CodeIEEE Trans Inf Forensics Secur
Originality Incremental advance
AI Analysis

This addresses scalability issues in person retrieval for surveillance and security applications, though it appears incremental as it builds on existing pretraining and meta-learning approaches.

The paper tackles the problem of domain bias in text-based person retrieval by introducing a domain-agnostic pretraining framework called CAMeL, which improves model generalization and achieves state-of-the-art results on benchmarks like CUHK-PEDES, ICFG-PEDES, and RSTPReid.

Text-based person retrieval aims to identify specific individuals within an image database using textual descriptions. Due to the high cost of annotation and privacy protection, researchers resort to synthesized data for the paradigm of pretraining and fine-tuning. However, these generated data often exhibit domain biases in both images and textual annotations, which largely compromise the scalability of the pre-trained model. Therefore, we introduce a domain-agnostic pretraining framework based on Cross-modality Adaptive Meta-Learning (CAMeL) to enhance the model generalization capability during pretraining to facilitate the subsequent downstream tasks. In particular, we develop a series of tasks that reflect the diversity and complexity of real-world scenarios, and introduce a dynamic error sample memory unit to memorize the history for errors encountered within multiple tasks. To further ensure multi-task adaptation, we also adopt an adaptive dual-speed update strategy, balancing fast adaptation to new tasks and slow weight updates for historical tasks. Albeit simple, our proposed model not only surpasses existing state-of-the-art methods on real-world benchmarks, including CUHK-PEDES, ICFG-PEDES, and RSTPReid, but also showcases robustness and scalability in handling biased synthetic images and noisy text annotations. Our code is available at https://github.com/Jahawn-Wen/CAMeL-reID.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes