HCAIJan 25, 2024

Underspecified Human Decision Experiments Considered Harmful

arXiv:2401.15106v65 citationsCHI
Originality Synthesis-oriented
AI Analysis

This work addresses methodological flaws in human-AI collaboration and data visualization research, which is incremental as it refines existing standards rather than introducing a new paradigm.

The paper tackles the problem of imprecise definitions in human decision experiments, particularly in AI-assisted decision-making, by proposing a widely applicable definition of decision problems and evaluating recent studies; they found that only 26% of 39 studies claiming to identify biased behavior provided sufficient information to support such claims.

Decision-making with information displays is a key focus of research in areas like human-AI collaboration and data visualization. However, what constitutes a decision problem, and what is required for an experiment to conclude that decisions are flawed, remain imprecise. We present a widely applicable definition of a decision problem synthesized from statistical decision theory and information economics. We claim that to attribute loss in human performance to bias, an experiment must provide the information that a rational agent would need to identify the normative decision. We evaluate whether recent empirical research on AI-assisted decisions achieves this standard. We find that only 10 (26%) of 39 studies that claim to identify biased behavior presented participants with sufficient information to make this claim in at least one treatment condition. We motivate the value of studying well-defined decision problems by describing a characterization of performance losses they allow to be conceived.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes