CVOct 5, 2017

Tracking Persons-of-Interest via Unsupervised Representation Adaptation

arXiv:1710.02139v133 citations
Originality Incremental advance
AI Analysis

This addresses the problem of tracking persons-of-interest in videos with large appearance variations, which is incremental as it adapts existing CNN methods to specific video contexts.

The paper tackles multi-face tracking in unconstrained videos by learning video-specific face representations using CNNs adapted with contextual constraints, achieving significant performance improvements over existing techniques on TV sitcoms and YouTube music videos.

Multi-face tracking in unconstrained videos is a challenging problem as faces of one person often appear drastically different in multiple shots due to significant variations in scale, pose, expression, illumination, and make-up. Existing multi-target tracking methods often use low-level features which are not sufficiently discriminative for identifying faces with such large appearance variations. In this paper, we tackle this problem by learning discriminative, video-specific face representations using convolutional neural networks (CNNs). Unlike existing CNN-based approaches which are only trained on large-scale face image datasets offline, we use the contextual constraints to generate a large number of training samples for a given video, and further adapt the pre-trained face CNN to specific videos using discovered training samples. Using these training samples, we optimize the embedding space so that the Euclidean distances correspond to a measure of semantic face similarity via minimizing a triplet loss function. With the learned discriminative features, we apply the hierarchical clustering algorithm to link tracklets across multiple shots to generate trajectories. We extensively evaluate the proposed algorithm on two sets of TV sitcoms and YouTube music videos, analyze the contribution of each component, and demonstrate significant performance improvement over existing techniques.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes