CVAIMar 1, 2018

Multi-Instance Dynamic Ordinal Random Fields for Weakly-supervised Facial Behavior Analysis

arXiv:1803.00907v124 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of reducing data annotation costs for researchers and practitioners in facial behavior analysis, though it is incremental as it builds on existing multi-instance learning and graphical model techniques.

The paper tackles weakly-supervised facial behavior analysis by proposing Multi-Instance Dynamic Ordinal Random Fields (MI-DORF), which models instance labels as latent variables in an undirected graphical model to handle ordinal and temporal data. It outperforms alternative approaches on Facial Action Unit and Pain intensity estimation tasks using DISFA and UNBC datasets, demonstrating reduced annotation efforts.

We propose a Multi-Instance-Learning (MIL) approach for weakly-supervised learning problems, where a training set is formed by bags (sets of feature vectors or instances) and only labels at bag-level are provided. Specifically, we consider the Multi-Instance Dynamic-Ordinal-Regression (MI-DOR) setting, where the instance labels are naturally represented as ordinal variables and bags are structured as temporal sequences. To this end, we propose Multi-Instance Dynamic Ordinal Random Fields (MI-DORF). In this framework, we treat instance-labels as temporally-dependent latent variables in an Undirected Graphical Model. Different MIL assumptions are modelled via newly introduced high-order potentials relating bag and instance-labels within the energy function of the model. We also extend our framework to address the Partially-Observed MI-DOR problems, where a subset of instance labels are available during training. We show on the tasks of weakly-supervised facial behavior analysis, Facial Action Unit (DISFA dataset) and Pain (UNBC dataset) Intensity estimation, that the proposed framework outperforms alternative learning approaches. Furthermore, we show that MIDORF can be employed to reduce the data annotation efforts in this context by large-scale.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes