CVDec 11, 2022

Mutimodal Ranking Optimization for Heterogeneous Face Re-identification

arXiv:2212.05510v2h-index: 18
Originality Incremental advance
AI Analysis

This addresses a domain discrepancy problem in video surveillance applications, but appears to be an incremental improvement over existing methods.

The paper tackles the problem of heterogeneous face re-identification across visible light and near-infrared cameras by proposing a multimodal fusion ranking optimization algorithm that uses face translation and fusion strategies, achieving improved performance on the SCface dataset.

Heterogeneous face re-identification, namely matching heterogeneous faces across disjoint visible light (VIS) and near-infrared (NIR) cameras, has become an important problem in video surveillance application. However, the large domain discrepancy between heterogeneous NIR-VIS faces makes the performance of face re-identification degraded dramatically. To solve this problem, a multimodal fusion ranking optimization algorithm for heterogeneous face re-identification is proposed in this paper. Firstly, we design a heterogeneous face translation network to obtain multimodal face pairs, including NIR-VIS/NIR-NIR/VIS-VIS face pairs, through mutual transformation between NIR-VIS faces. Secondly, we propose linear and non-linear fusion strategies to aggregate initial ranking lists of multimodal face pairs and acquire the optimized re-ranked list based on modal complementarity. The experimental results show that the proposed multimodal fusion ranking optimization algorithm can effectively utilize the complementarity and outperforms some relative methods on the SCface dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes