CVJul 19, 2019

VRSTC: Occlusion-Free Video Person Re-Identification

arXiv:1907.08427v1226 citations
Originality Highly original
AI Analysis

This addresses occlusion issues in surveillance video analysis, offering a novel solution for more accurate person re-identification in practical scenarios.

The paper tackles the problem of performance degradation in video person re-identification under partial occlusion by proposing STCnet to recover occluded parts using spatio-temporal information, resulting in a framework (VRSTC) that outperforms state-of-the-art methods on three challenging databases.

Video person re-identification (re-ID) plays an important role in surveillance video analysis. However, the performance of video re-ID degenerates severely under partial occlusion. In this paper, we propose a novel network, called Spatio-Temporal Completion network (STCnet), to explicitly handle partial occlusion problem. Different from most previous works that discard the occluded frames, STCnet can recover the appearance of the occluded parts. For one thing, the spatial structure of a pedestrian frame can be used to predict the occluded body parts from the unoccluded body parts of this frame. For another, the temporal patterns of pedestrian sequence provide important clues to generate the contents of occluded parts. With the Spatio-temporal information, STCnet can recover the appearance for the occluded parts, which could be leveraged with those unoccluded parts for more accurate video re-ID. By combining a re-ID network with STCnet, a video re-ID framework robust to partial occlusion (VRSTC) is proposed. Experiments on three challenging video re-ID databases demonstrate that the proposed approach outperforms the state-of-the-art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes