CVApr 7, 2019

Unsupervised Domain Adaptation for Multispectral Pedestrian Detection

arXiv:1904.03692v119 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of deploying robust pedestrian detection systems for applications like autonomous driving and surveillance without costly manual labeling across different multispectral datasets.

The paper tackles the problem of training multispectral pedestrian detectors without manual annotations across different datasets by proposing an unsupervised domain adaptation framework that iteratively generates pseudo annotations and updates detector parameters. Experimental results show the method achieves significantly higher detection performance than approaches without domain adaptation and is competitive with supervised detectors.

Multimodal information (e.g., visible and thermal) can generate robust pedestrian detections to facilitate around-the-clock computer vision applications, such as autonomous driving and video surveillance. However, it still remains a crucial challenge to train a reliable detector working well in different multispectral pedestrian datasets without manual annotations. In this paper, we propose a novel unsupervised domain adaptation framework for multispectral pedestrian detection, by iteratively generating pseudo annotations and updating the parameters of our designed multispectral pedestrian detector on target domain. Pseudo annotations are generated using the detector trained on source domain, and then updated by fixing the parameters of detector and minimizing the cross entropy loss without back-propagation. Training labels are generated using the pseudo annotations by considering the characteristics of similarity and complementarity between well-aligned visible and infrared image pairs. The parameters of detector are updated using the generated labels by minimizing our defined multi-detection loss function with back-propagation. The optimal parameters of detector can be obtained after iteratively updating the pseudo annotations and parameters. Experimental results show that our proposed unsupervised multimodal domain adaptation method achieves significantly higher detection performance than the approach without domain adaptation, and is competitive with the supervised multispectral pedestrian detectors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes