CVJul 16, 2018

SCAN: Self-and-Collaborative Attention Network for Video Person Re-identification

arXiv:1807.05688v484 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of matching pedestrian sequences across camera views for surveillance applications, representing an incremental improvement over existing methods.

The paper tackled video person re-identification by proposing the Self-and-Collaborative Attention Network (SCAN), which uses attention mechanisms and a new similarity measurement to improve feature representation and matching, achieving state-of-the-art performance on datasets like iLIDS-VID, PRID2011, and MARS.

Video person re-identification attracts much attention in recent years. It aims to match image sequences of pedestrians from different camera views. Previous approaches usually improve this task from three aspects, including a) selecting more discriminative frames, b) generating more informative temporal representations, and c) developing more effective distance metrics. To address the above issues, we present a novel and practical deep architecture for video person re-identification termed Self-and-Collaborative Attention Network (SCAN). It has several appealing properties. First, SCAN adopts non-parametric attention mechanism to refine the intra-sequence and inter-sequence feature representation of videos, and outputs self-and-collaborative feature representation for each video, making the discriminative frames aligned between the probe and gallery sequences.Second, beyond existing models, a generalized pairwise similarity measurement is proposed to calculate the similarity feature representations of video pairs, enabling computing the matching scores by the binary classifier. Third, a dense clip segmentation strategy is also introduced to generate rich probe-gallery pairs to optimize the model. Extensive experiments demonstrate the effectiveness of SCAN, which outperforms the best-performing baselines on iLIDS-VID, PRID2011 and MARS dataset, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes