LGNov 26, 2024

Multi-Label Bayesian Active Learning with Inter-Label Relationships

arXiv:2411.17941v3h-index: 4UAI
Originality Incremental advance
AI Analysis

This work addresses multi-label active learning for scenarios with label dependencies and imbalanced data, representing an incremental improvement over existing methods.

The paper tackles the challenge of multi-label active learning by addressing label correlations and data imbalances, proposing a method that uses updated correlation matrices and ensemble pseudo labeling to achieve superior performance on four realistic datasets.

The primary challenge of multi-label active learning, differing it from multi-class active learning, lies in assessing the informativeness of an indefinite number of labels while also accounting for the inherited label correlation. Existing studies either require substantial computational resources to leverage correlations or fail to fully explore label dependencies. Additionally, real-world scenarios often require addressing intrinsic biases stemming from imbalanced data distributions. In this paper, we propose a new multi-label active learning strategy to address both challenges. Our method incorporates progressively updated positive and negative correlation matrices to capture co-occurrence and disjoint relationships within the label space of annotated samples, enabling a holistic assessment of uncertainty rather than treating labels as isolated elements. Furthermore, alongside diversity, our model employs ensemble pseudo labeling and beta scoring rules to address data imbalances. Extensive experiments on four realistic datasets demonstrate that our strategy consistently achieves more reliable and superior performance, compared to several established methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes