CVAug 26, 2020

Keypoint-Aligned Embeddings for Image Retrieval and Re-identification

arXiv:2008.11368v10.0031 citations
AI Analysis50

This addresses pose invariance for tasks like person, vehicle, or animal re-identification, offering a compact and generic solution with incremental improvements.

The paper tackles the problem of high intra-class variance in image retrieval and re-identification due to deformable shapes and varying viewpoints by proposing keypoint-aligned embeddings, achieving state-of-the-art performance on benchmark datasets like CUB-200-2011, Cars196, and VeRi-776.

Learning embeddings that are invariant to the pose of the object is crucial in visual image retrieval and re-identification. The existing approaches for person, vehicle, or animal re-identification tasks suffer from high intra-class variance due to deformable shapes and different camera viewpoints. To overcome this limitation, we propose to align the image embedding with a predefined order of the keypoints. The proposed keypoint aligned embeddings model (KAE-Net) learns part-level features via multi-task learning which is guided by keypoint locations. More specifically, KAE-Net extracts channels from a feature map activated by a specific keypoint through learning the auxiliary task of heatmap reconstruction for this keypoint. The KAE-Net is compact, generic and conceptually simple. It achieves state of the art performance on the benchmark datasets of CUB-200-2011, Cars196 and VeRi-776 for retrieval and re-identification tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes