LGAIFeb 2, 2024

A General Framework for Learning from Weak Supervision

Peking U
arXiv:2402.01922v315 citationsh-index: 23ICML
Originality Highly original
AI Analysis

This work addresses scalability and versatility issues in weakly supervised learning, which is important for practical deployment in machine learning applications.

The paper tackles the challenges of weakly supervised learning by introducing a general framework (GLWS) that accommodates various weak supervision sources and reduces computational complexity from quadratic/factorial to linear scale, demonstrating superior performance across 11 scenarios.

Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment. This paper introduces a general framework for learning from weak supervision (GLWS) with a novel algorithm. Central to GLWS is an Expectation-Maximization (EM) formulation, adeptly accommodating various weak supervision sources, including instance partial labels, aggregate statistics, pairwise observations, and unlabeled data. We further present an advanced algorithm that significantly simplifies the EM computational demands using a Non-deterministic Finite Automaton (NFA) along with a forward-backward algorithm, which effectively reduces time complexity from quadratic or factorial often required in existing solutions to linear scale. The problem of learning from arbitrary weak supervision is therefore converted to the NFA modeling of them. GLWS not only enhances the scalability of machine learning models but also demonstrates superior performance and versatility across 11 weak supervision scenarios. We hope our work paves the way for further advancements and practical deployment in this field.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes