AICYHCLGJan 13, 2021

Understanding the Effect of Out-of-distribution Examples and Interactive Explanations on Human-AI Decision Making

arXiv:2101.05303v4143 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of achieving complementary performance in human-AI teams for prediction tasks, though it is incremental in exploring experimental setups and interfaces.

The study investigated how out-of-distribution examples and interactive explanations affect human-AI decision-making, finding clear performance differences between in-distribution and out-of-distribution scenarios and mixed results for interactive explanations, which improved perceived usefulness but sometimes reinforced biases.

Although AI holds promise for improving human decision making in societally critical domains, it remains an open question how human-AI teams can reliably outperform AI alone and human alone in challenging prediction tasks (also known as complementary performance). We explore two directions to understand the gaps in achieving complementary performance. First, we argue that the typical experimental setup limits the potential of human-AI teams. To account for lower AI performance out-of-distribution than in-distribution because of distribution shift, we design experiments with different distribution types and investigate human performance for both in-distribution and out-of-distribution examples. Second, we develop novel interfaces to support interactive explanations so that humans can actively engage with AI assistance. Using virtual pilot studies and large-scale randomized experiments across three tasks, we demonstrate a clear difference between in-distribution and out-of-distribution, and observe mixed results for interactive explanations: while interactive explanations improve human perception of AI assistance's usefulness, they may reinforce human biases and lead to limited performance improvement. Overall, our work points out critical challenges and future directions towards enhancing human performance with AI assistance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes