AI LGOct 21, 2015

Time-Sensitive Bayesian Information Aggregation for Crowdsourcing Systems

Matteo Venanzi, John Guiver, Pushmeet Kohli, Nick Jennings

arXiv:1510.06335v211.332 citations

Originality Incremental advance

AI Analysis

This addresses the need for more reliable and efficient crowdsourcing systems by improving aggregation accuracy and duration estimation, though it is incremental as it builds on existing Bayesian methods with a novel time-based twist.

The paper tackles the problem of aggregating unreliable crowdsourced judgments and estimating task durations by introducing a time-sensitive Bayesian method called BCCTime, which uses worker completion times as indicators of judgment quality. The results show that BCCTime achieves up to 11% more accurate classifications and up to 100% more informative duration estimates compared to state-of-the-art methods on entity linking datasets.

Crowdsourcing systems commonly face the problem of aggregating multiple judgments provided by potentially unreliable workers. In addition, several aspects of the design of efficient crowdsourcing processes, such as defining worker's bonuses, fair prices and time limits of the tasks, involve knowledge of the likely duration of the task at hand. Bringing this together, in this work we introduce a new time--sensitive Bayesian aggregation method that simultaneously estimates a task's duration and obtains reliable aggregations of crowdsourced judgments. Our method, called BCCTime, builds on the key insight that the time taken by a worker to perform a task is an important indicator of the likely quality of the produced judgment. To capture this, BCCTime uses latent variables to represent the uncertainty about the workers' completion time, the tasks' duration and the workers' accuracy. To relate the quality of a judgment to the time a worker spends on a task, our model assumes that each task is completed within a latent time window within which all workers with a propensity to genuinely attempt the labelling task (i.e., no spammers) are expected to submit their judgments. In contrast, workers with a lower propensity to valid labeling, such as spammers, bots or lazy labelers, are assumed to perform tasks considerably faster or slower than the time required by normal workers. Specifically, we use efficient message-passing Bayesian inference to learn approximate posterior probabilities of (i) the confusion matrix of each worker, (ii) the propensity to valid labeling of each worker, (iii) the unbiased duration of each task and (iv) the true label of each task. Using two real-world public datasets for entity linking tasks, we show that BCCTime produces up to 11% more accurate classifications and up to 100% more informative estimates of a task's duration compared to state-of-the-art methods.

View on arXiv PDF

Similar