MLAIHCLGDec 31, 2024

Efficient Human-in-the-Loop Active Learning: A Novel Framework for Data Labeling in AI Systems

arXiv:2501.00277v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficiently using expert time for data labeling in domains like radiology, though it appears incremental as it builds on existing active learning methods.

The paper tackles the problem of costly data labeling in AI by proposing a novel human-in-the-loop active learning framework that integrates different query schemes and a data-driven exploration-exploitation approach, resulting in higher accuracy and lower loss compared to other methods on five real-world datasets.

Modern AI algorithms require labeled data. In real world, majority of data are unlabeled. Labeling the data are costly. this is particularly true for some areas requiring special skills, such as reading radiology images by physicians. To most efficiently use expert's time for the data labeling, one promising approach is human-in-the-loop active learning algorithm. In this work, we propose a novel active learning framework with significant potential for application in modern AI systems. Unlike the traditional active learning methods, which only focus on determining which data point should be labeled, our framework also introduces an innovative perspective on incorporating different query scheme. We propose a model to integrate the information from different types of queries. Based on this model, our active learning frame can automatically determine how the next question is queried. We further developed a data driven exploration and exploitation framework into our active learning method. This method can be embedded in numerous active learning algorithms. Through simulations on five real-world datasets, including a highly complex real image task, our proposed active learning framework exhibits higher accuracy and lower loss compared to other methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes