CVLGJun 7, 2019

Visual Person Understanding through Multi-Task and Multi-Dataset Learning

arXiv:1906.03019v15 citations
Originality Incremental advance
AI Analysis

This addresses the need for holistic person understanding in applications like mobile robotics, though it is incremental as it builds on multi-task learning.

The paper tackles the problem of learning a single model for multiple person understanding tasks (re-identification, attribute classification, body part segmentation, and pose estimation) by combining datasets, and the result is a model that matches or outperforms single-task counterparts without significant computational overhead.

We address the problem of learning a single model for person re-identification, attribute classification, body part segmentation, and pose estimation. With predictions for these tasks we gain a more holistic understanding of persons, which is valuable for many applications. This is a classical multi-task learning problem. However, no dataset exists that these tasks could be jointly learned from. Hence several datasets need to be combined during training, which in other contexts has often led to reduced performance in the past. We extensively evaluate how the different task and datasets influence each other and how different degrees of parameter sharing between the tasks affect performance. Our final model matches or outperforms its single-task counterparts without creating significant computational overhead, rendering it highly interesting for resource-constrained scenarios such as mobile robotics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes