CVNov 20, 2018

Sequence-based Person Attribute Recognition with Joint CTC-Attention Model

arXiv:1811.08115v214 citations
AI Analysis

This work addresses attribute recognition for computer vision applications like person re-identification, but it appears incremental as it combines existing CTC and attention mechanisms.

The paper tackles person attribute recognition by proposing a joint CTC-Attention model that maps attribute labels into sequences to learn semantic relationships, and it demonstrates effectiveness through experiments on three public datasets.

Attribute recognition has become crucial because of its wide applications in many computer vision tasks, such as person re-identification. Like many object recognition problems, variations in viewpoints, illumination, and recognition at far distance, all make this task challenging. In this work, we propose a joint CTC-Attention model (JCM), which maps attribute labels into sequences to learn the semantic relationship among attributes. Besides, this network uses neural network to encode images into sequences, and employs connectionist temporal classification (CTC) loss to train the network with the aim of improving the encoding performance of the network. At the same time, it adopts the attention model to decode the sequences, which can realize aligning the sequences and better learning the semantic information from attributes. Extensive experiments on three public datasets, i.e., Market-1501 attribute dataset, Duke attribute dataset and PETA dataset, demonstrate the effectiveness of the proposed method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes