HCFeb 28, 2017

Probabilistic Perspectives on Collecting Human Uncertainty in Predictive Data Mining

arXiv:1702.08826v15 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of modeling human uncertainty for data mining practitioners, but it is incremental as it builds on existing work in human-computer interaction.

The paper tackles the problem of human uncertainty in data mining, where users' volatile responses on ordinal scales affect algorithm performance, and finds that significant numbers of users submit actions different from their cognition, complicating statistical algorithm assessment.

In many areas of data mining, data is collected from humans beings. In this contribution, we ask the question of how people actually respond to ordinal scales. The main problem observed is that users tend to be volatile in their choices, i.e. complex cognitions do not always lead to the same decisions, but to distributions of possible decision outputs. This human uncertainty may sometimes have quite an impact on common data mining approaches and thus, the question of effective modelling this so called human uncertainty emerges naturally. Our contribution introduces two different approaches for modelling the human uncertainty of user responses. In doing so, we develop techniques in order to measure this uncertainty at the level of user inputs as well as the level of user cognition. With support of comprehensive user experiments and large-scale simulations, we systematically compare both methodologies along with their implications for personalisation approaches. Our findings demonstrate that significant amounts of users do submit something completely different (action) than they really have in mind (cognition). Moreover, we demonstrate that statistically sound evidence with respect to algorithm assessment becomes quite hard to realise, especially when explicit rankings shall be built.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes