LGDCNov 2, 2016

Scalable Semi-Supervised Learning over Networks using Nonsmooth Convex Optimization

arXiv:1611.00714v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of handling massive network-structured data for semi-supervised learning, though it appears incremental as it builds on existing smoothness assumptions and optimization methods.

The paper tackles the problem of semi-supervised learning on large network datasets by formulating it as a nonsmooth convex optimization problem based on graph signal total variation, achieving scalability through a message-passing implementation in big data frameworks.

We propose a scalable method for semi-supervised (transductive) learning from massive network-structured datasets. Our approach to semi-supervised learning is based on representing the underlying hypothesis as a graph signal with small total variation. Requiring a small total variation of the graph signal representing the underlying hypothesis corresponds to the central smoothness assumption that forms the basis for semi-supervised learning, i.e., input points forming clusters have similar output values or labels. We formulate the learning problem as a nonsmooth convex optimization problem which we solve by appealing to Nesterovs optimal first-order method for nonsmooth optimization. We also provide a message passing formulation of the learning method which allows for a highly scalable implementation in big data frameworks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes