End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification
This addresses the problem of model generalization in person re-identification for real-world applications, representing an incremental advance by integrating attention into existing generative adversarial frameworks.
The paper tackles the challenge of person re-identification across domains by proposing an end-to-end network that jointly translates images and learns discriminative features, resulting in significant performance improvements over state-of-the-art methods on benchmark datasets.
Person re-identification (re-ID) remains challenging in a real-world scenario, as it requires a trained network to generalise to totally unseen target data in the presence of variations across domains. Recently, generative adversarial models have been widely adopted to enhance the diversity of training data. These approaches, however, often fail to generalise to other domains, as existing generative person re-identification models have a disconnect between the generative component and the discriminative feature learning stage. To address the on-going challenges regarding model generalisation, we propose an end-to-end domain adaptive attention network to jointly translate images between domains and learn discriminative re-id features in a single framework. To address the domain gap challenge, we introduce an attention module for image translation from source to target domains without affecting the identity of a person. More specifically, attention is directed to the background instead of the entire image of the person, ensuring identifying characteristics of the subject are preserved. The proposed joint learning network results in a significant performance improvement over state-of-the-art methods on several benchmark datasets.