CVJan 4, 2016

Multi-task CNN Model for Attribute Prediction

Abrar H. Abdulnabi, Gang Wang, Jiwen Lu, Kui Jia

arXiv:1601.00400v1308 citations

Originality Synthesis-oriented

AI Analysis

This work addresses attribute prediction in computer vision, but it is incremental as it builds on existing multi-task learning and CNN methods.

The paper tackles the problem of predicting binary semantic attributes in images by proposing a multi-task CNN model that shares visual knowledge across attributes, and demonstrates its effectiveness on two popular datasets.

This paper proposes a joint multi-task learning algorithm to better predict attributes in images using deep convolutional neural networks (CNN). We consider learning binary semantic attributes through a multi-task CNN model, where each CNN will predict one binary attribute. The multi-task learning allows CNN models to simultaneously share visual knowledge among different attribute categories. Each CNN will generate attribute-specific feature representations, and then we apply multi-task learning on the features to predict their attributes. In our multi-task framework, we propose a method to decompose the overall model's parameters into a latent task matrix and combination matrix. Furthermore, under-sampled classifiers can leverage shared statistics from other classifiers to improve their performance. Natural grouping of attributes is applied such that attributes in the same group are encouraged to share more knowledge. Meanwhile, attributes in different groups will generally compete with each other, and consequently share less knowledge. We show the effectiveness of our method on two popular attribute datasets.

View on arXiv PDF

Similar