CVOct 2, 2020

Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus

arXiv:2010.01086v211 citations
Originality Incremental advance
AI Analysis

This addresses the problem of learning multiple visual tasks with limited labeled data for computer vision applications, representing an incremental improvement through a novel graph-based consensus approach.

The paper tackles semi-supervised learning for multi-task scene understanding by using a graph of neural networks to find consensus among different interpretations, achieving superior performance compared to state-of-the-art methods on a large dataset.

We address the challenging problem of semi-supervised learning in the context of multiple visual interpretations of the world by finding consensus in a graph of neural networks. Each graph node is a scene interpretation layer, while each edge is a deep net that transforms one layer at one node into another from a different node. During the supervised phase edge networks are trained independently. During the next unsupervised stage edge nets are trained on the pseudo-ground truth provided by consensus among multiple paths that reach the nets' start and end nodes. These paths act as ensemble teachers for any given edge and strong consensus is used for high-confidence supervisory signal. The unsupervised learning process is repeated over several generations, in which each edge becomes a "student" and also part of different ensemble "teachers" for training other students. By optimizing such consensus between different paths, the graph reaches consistency and robustness over multiple interpretations and generations, in the face of unknown labels. We give theoretical justifications of the proposed idea and validate it on a large dataset. We show how prediction of different representations such as depth, semantic segmentation, surface normals and pose from RGB input could be effectively learned through self-supervised consensus in our graph. We also compare to state-of-the-art methods for multi-task and semi-supervised learning and show superior performance.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes