MLDCLGDec 15, 2016

Efficient Distributed Semi-Supervised Learning using Stochastic Regularization over Affinity Graphs

arXiv:1612.04898v2
AI Analysis

This work addresses the challenge of scalable semi-supervised learning for domains like speech processing, though it is incremental as it builds on existing graph-based methods.

The paper tackles the problem of training deep neural networks with limited labeled data by introducing a stochastic graph-regularization technique for efficient distributed semi-supervised learning, resulting in improved classification accuracy when labeled data is scarce and significant speed-up in convergence time for parallel training.

We describe a computationally efficient, stochastic graph-regularization technique that can be utilized for the semi-supervised training of deep neural networks in a parallel or distributed setting. We utilize a technique, first described in [13] for the construction of mini-batches for stochastic gradient descent (SGD) based on synthesized partitions of an affinity graph that are consistent with the graph structure, but also preserve enough stochasticity for convergence of SGD to good local minima. We show how our technique allows a graph-based semi-supervised loss function to be decomposed into a sum over objectives, facilitating data parallelism for scalable training of machine learning models. Empirical results indicate that our method significantly improves classification accuracy compared to the fully-supervised case when the fraction of labeled data is low, and in the parallel case, achieves significant speed-up in terms of wall-clock time to convergence. We show the results for both sequential and distributed-memory semi-supervised DNN training on a speech corpus.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes