LG AINov 28, 2025

A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees

Miao Zhang, Junpeng Li, Changchun Hua, Yana Yang

arXiv:2511.22823v1

Originality Highly original

AI Analysis

It addresses the problem of fragmented and unstable methods in weakly supervised learning for researchers and practitioners, offering a principled solution with theoretical guarantees.

The paper tackles the instability and lack of a unified approach in weakly supervised learning by proposing a stable surrogate risk framework that subsumes diverse settings like PU, UU, CLL, and PLL, achieving consistent gains across experiments without heuristic stabilization.

Weakly supervised learning has emerged as a practical alternative to fully supervised learning when complete and accurate labels are costly or infeasible to acquire. However, many existing methods are tailored to specific supervision patterns -- such as positive-unlabeled (PU), unlabeled-unlabeled (UU), complementary-label (CLL), partial-label (PLL), or similarity-unlabeled annotations -- and rely on post-hoc corrections to mitigate instability induced by indirect supervision. We propose a principled, unified framework that bypasses such post-hoc adjustments by directly formulating a stable surrogate risk grounded in the structure of weakly supervised data. The formulation naturally subsumes diverse settings -- including PU, UU, CLL, PLL, multi-class unlabeled, and tuple-based learning -- under a single optimization objective. We further establish a non-asymptotic generalization bound via Rademacher complexity that clarifies how supervision structure, model capacity, and sample size jointly govern performance. Beyond this, we analyze the effect of class-prior misspecification on the bound, deriving explicit terms that quantify its impact, and we study identifiability, giving sufficient conditions -- most notably via supervision stratification across groups -- under which the target risk is recoverable. Extensive experiments show consistent gains across class priors, dataset scales, and class counts -- without heuristic stabilization -- while exhibiting robustness to overfitting.

View on arXiv PDF

Similar