LG AIFeb 2, 2024

A General Framework for Learning from Weak Supervision

Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

Peking U

arXiv:2402.01922v315.715 citationsh-index: 23Has CodeICML

Originality Highly original

AI Analysis

This work addresses scalability and versatility issues in weakly supervised learning, which is important for practical deployment in machine learning applications.

The paper tackles the challenges of weakly supervised learning by introducing a general framework (GLWS) that accommodates various weak supervision sources and reduces computational complexity from quadratic/factorial to linear scale, demonstrating superior performance across 11 scenarios.

Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment. This paper introduces a general framework for learning from weak supervision (GLWS) with a novel algorithm. Central to GLWS is an Expectation-Maximization (EM) formulation, adeptly accommodating various weak supervision sources, including instance partial labels, aggregate statistics, pairwise observations, and unlabeled data. We further present an advanced algorithm that significantly simplifies the EM computational demands using a Non-deterministic Finite Automaton (NFA) along with a forward-backward algorithm, which effectively reduces time complexity from quadratic or factorial often required in existing solutions to linear scale. The problem of learning from arbitrary weak supervision is therefore converted to the NFA modeling of them. GLWS not only enhances the scalability of machine learning models but also demonstrates superior performance and versatility across 11 weak supervision scenarios. We hope our work paves the way for further advancements and practical deployment in this field.

View on arXiv PDF Code

Similar