CVLGIVDec 26, 2024

Extended Cross-Modality United Learning for Unsupervised Visible-Infrared Person Re-identification

arXiv:2412.19134v1h-index: 4
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing inter-modality gaps in cross-modality person re-identification for security and surveillance applications, representing an incremental improvement over existing methods.

The paper tackles the problem of unsupervised visible-infrared person re-identification by proposing the ECUL framework to learn modality-invariant features, achieving promising performance that outperforms certain supervised methods on SYSU-MM01 and RegDB datasets.

Unsupervised learning visible-infrared person re-identification (USL-VI-ReID) aims to learn modality-invariant features from unlabeled cross-modality datasets and reduce the inter-modality gap. However, the existing methods lack cross-modality clustering or excessively pursue cluster-level association, which makes it difficult to perform reliable modality-invariant features learning. To deal with this issue, we propose a Extended Cross-Modality United Learning (ECUL) framework, incorporating Extended Modality-Camera Clustering (EMCC) and Two-Step Memory Updating Strategy (TSMem) modules. Specifically, we design ECUL to naturally integrates intra-modality clustering, inter-modality clustering and inter-modality instance selection, establishing compact and accurate cross-modality associations while reducing the introduction of noisy labels. Moreover, EMCC captures and filters the neighborhood relationships by extending the encoding vector, which further promotes the learning of modality-invariant and camera-invariant knowledge in terms of clustering algorithm. Finally, TSMem provides accurate and generalized proxy points for contrastive learning by updating the memory in stages. Extensive experiments results on SYSU-MM01 and RegDB datasets demonstrate that the proposed ECUL shows promising performance and even outperforms certain supervised methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes