CVDec 12, 2020

Teacher-Student Asynchronous Learning with Multi-Source Consistency for Facial Landmark Detection

arXiv:2012.06711v1
AI Analysis

This work provides an incremental improvement in semi-supervised facial landmark detection for computer vision researchers and practitioners, by proposing a new teacher-student learning framework.

This paper introduces a teacher-student asynchronous learning (TSAL) framework for semi-supervised facial landmark detection in videos, addressing the high annotation cost. The TSAL framework implicitly mines pseudo-labels through consistency constraints between a radical student model, updated with multi-source supervision, and a calm teacher model, updated with single-source supervision and recursive average filtering. The framework achieves state-of-the-art performance on 300W, AFLW, and 300VW benchmarks.

Due to the high annotation cost of large-scale facial landmark detection tasks in videos, a semi-supervised paradigm that uses self-training for mining high-quality pseudo-labels to participate in training has been proposed by researchers. However, self-training based methods often train with a gradually increasing number of samples, whose performances vary a lot depending on the number of pseudo-labeled samples added. In this paper, we propose a teacher-student asynchronous learning~(TSAL) framework based on the multi-source supervision signal consistency criterion, which implicitly mines pseudo-labels through consistency constraints. Specifically, the TSAL framework contains two models with exactly the same structure. The radical student uses multi-source supervision signals from the same task to update parameters, while the calm teacher uses a single-source supervision signal to update parameters. In order to reasonably absorb student's suggestions, teacher's parameters are updated again through recursive average filtering. The experimental results prove that asynchronous-learning framework can effectively filter noise in multi-source supervision signals, thereby mining the pseudo-labels which are more significant for network parameter updating. And extensive experiments on 300W, AFLW, and 300VW benchmarks show that the TSAL framework achieves state-of-the-art performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes