Deep Perceptual Mapping for Cross-Modal Face Recognition
This addresses a critical need for night-time surveillance and security by reducing the modality gap in thermal-to-visible face matching, though it is incremental as it builds on existing deep learning methods.
The paper tackles the challenging problem of cross-modal face recognition between thermal and visible spectra, achieving significant performance improvements, including a 10% increase on the UND-X1 dataset and 15-30% on the NVESD dataset in Rank-1 identification.
Cross modal face matching between the thermal and visible spectrum is a much desired capability for night-time surveillance and security applications. Due to a very large modality gap, thermal-to-visible face recognition is one of the most challenging face matching problem. In this paper, we present an approach to bridge this modality gap by a significant margin. Our approach captures the highly non-linear relationship between the two modalities by using a deep neural network. Our model attempts to learn a non-linear mapping from visible to thermal spectrum while preserving the identity information. We show substantive performance improvement on three difficult thermal-visible face datasets. The presented approach improves the state-of-the-art by more than 10\% on UND-X1 dataset and by more than 15-30\% on NVESD dataset in terms of Rank-1 identification. Our method bridges the drop in performance due to the modality gap by more than 40\%.