CVLGOct 28, 2019

Shoestring: Graph-Based Semi-Supervised Learning with Severely Limited Labeled Data

arXiv:1910.12976v21 citations
AI Analysis

It addresses the problem of limited labeled data for researchers and practitioners in machine learning, offering an incremental improvement over existing methods.

The paper tackles graph-based semi-supervised learning with severely limited labeled data by proposing the Shoestring framework, which improves performance through semantic transfer from few labeled samples to many unlabeled ones, achieving state-of-the-art node classification and significant gains on image datasets like miniImageNet (2.57% to 3.59%) and tieredImageNet (1.05% to 2.70%).

Graph-based semi-supervised learning has been shown to be one of the most effective approaches for classification tasks from a wide range of domains, such as image classification and text classification, as they can exploit the connectivity patterns between labeled and unlabeled samples to improve learning performance. In this work, we advance this effective learning paradigm towards a scenario where labeled data are severely limited. More specifically, we address the problem of graph-based semi-supervised learning in the presence of severely limited labeled samples, and propose a new framework, called {\em Shoestring}, that improves the learning performance through semantic transfer from these very few labeled samples to large numbers of unlabeled samples. In particular, our framework learns a metric space in which classification can be performed by computing the similarity to centroid embedding of each class. {\em Shoestring} is trained in an end-to-end fashion to learn to leverage the semantic knowledge of limited labeled samples as well as their connectivity patterns with large numbers of unlabeled samples simultaneously. By combining {\em Shoestring} with graph convolutional networks, label propagation and their recent label-efficient variations (IGCN and GLP), we are able to achieve state-of-the-art node classification performance in the presence of very few labeled samples. In addition, we demonstrate the effectiveness of our framework on image classification tasks in the few-shot learning regime, with significant gains on miniImageNet ($2.57\%\sim3.59\%$) and tieredImageNet ($1.05\%\sim2.70\%$).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes