CVDec 11, 2020

A Multi-task Joint Framework for Real-time Person Search

Ye Li, Kangning Yin, Jie Liang, Chunyu Wang, Guangqiang Yin

arXiv:2012.06418v11.2

Originality Incremental advance

AI Analysis

This work provides an incremental improvement in real-time person search for surveillance and security applications.

This paper proposes a Multi-task Joint Framework (MJF) for real-time person search, addressing the challenges of detection accuracy affecting comparison and the difficulty of real-time performance. The framework achieves an identification rate of 93.6% and 25.7 frames per second on 1920*1080 resolution video with 500 IDs.

Person search generally involves three important parts: person detection, feature extraction and identity comparison. However, person search integrating detection, extraction and comparison has the following drawbacks. Firstly, the accuracy of detection will affect the accuracy of comparison. Secondly, it is difficult to achieve real-time in real-world applications. To solve these problems, we propose a Multi-task Joint Framework for real-time person search (MJF), which optimizes the person detection, feature extraction and identity comparison respectively. For the person detection module, we proposed the YOLOv5-GS model, which is trained with person dataset. It combines the advantages of the Ghostnet and the Squeeze-and-Excitation (SE) block, and improves the speed and accuracy. For the feature extraction module, we design the Model Adaptation Architecture (MAA), which could select different network according to the number of people. It could balance the relationship between accuracy and speed. For identity comparison, we propose a Three Dimension (3D) Pooled Table and a matching strategy to improve identification accuracy. On the condition of 1920*1080 resolution video and 500 IDs table, the identification rate (IR) and frames per second (FPS) achieved by our method could reach 93.6% and 25.7,

View on arXiv PDF

Similar