DCLGApr 8

DynLP: Parallel Dynamic Batch Update for Label Propagation in Semi-Supervised Learning

arXiv:2604.065965.0h-index: 24
Predicted impact top 92% in DC · last 90 daysOriginality Incremental advance
AI Analysis

This addresses a computational bottleneck for researchers and practitioners handling streaming or incremental data in semi-supervised learning, though it is incremental as it optimizes an existing method rather than introducing a new paradigm.

The paper tackles the inefficiency of recomputing all labels for incremental batch updates in graph-based semi-supervised learning by proposing DynLP, a GPU-centric dynamic batched parallel algorithm that achieves an average 13x and up to 102x speedup on large-scale datasets compared to state-of-the-art methods.

Semi-supervised learning aims to infer class labels using only a small fraction of labeled data. In graph-based semi-supervised learning, this is typically achieved through label propagation to predict labels of unlabeled nodes. However, in real-world applications, data often arrive incrementally in batches. Each time a new batch appears, reapplying the traditional label propagation algorithm to recompute all labels is redundant, computationally intensive, and inefficient. To address the absence of an efficient label propagation update method, we propose DynLP, a novel GPU-centric Dynamic Batched Parallel Label Propagation algorithm that performs only the necessary updates, propagating changes to the relevant subgraph without requiring full recalculation. By exploiting GPU architectural optimizations, our algorithm achieves on average 13x and upto 102x speedup on large-scale datasets compared to state-of-the-art approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes