HCLGSep 4, 2020

Leveraging Clickstream Trajectories to Reveal Low-Quality Workers in Crowdsourced Forecasting Platforms

arXiv:2009.01966v1
Originality Synthesis-oriented
AI Analysis

This addresses the issue of data validity in crowdsourced platforms, particularly for sensitive applications like geopolitical forecasting, but is incremental as it applies existing clustering methods to a new domain.

The study tackled the problem of low-quality workers in crowdsourced forecasting by proposing a computational framework using clickstream trajectories to identify clusters of underperformers, such as those with inaccurate forecasts, poor explanations, or copy-pasting behavior.

Crowdwork often entails tackling cognitively-demanding and time-consuming tasks. Crowdsourcing can be used for complex annotation tasks, from medical imaging to geospatial data, and such data powers sensitive applications, such as health diagnostics or autonomous driving. However, the existence and prevalence of underperforming crowdworkers is well-recognized, and can pose a threat to the validity of crowdsourcing. In this study, we propose the use of a computational framework to identify clusters of underperforming workers using clickstream trajectories. We focus on crowdsourced geopolitical forecasting. The framework can reveal different types of underperformers, such as workers with forecasts whose accuracy is far from the consensus of the crowd, those who provide low-quality explanations for their forecasts, and those who simply copy-paste their forecasts from other users. Our study suggests that clickstream clustering and analysis are fundamental tools to diagnose the performance of crowdworkers in platforms leveraging the wisdom of crowds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes