Confidence-weighted integration of human and machine judgments for superior decision-making
This work addresses the challenge of leveraging human expertise alongside AI for better outcomes in decision-making, though it is incremental as it builds on existing Bayesian and logistic regression methods.
The paper tackled the problem of integrating human and machine judgments to improve decision-making, demonstrating that combining confidence-weighted judgments consistently enhanced team performance in image classification and neuroscience forecasting tasks.
Large language models (LLMs) can surpass humans in certain forecasting tasks. What role does this leave for humans in the overall decision process? One possibility is that humans, despite performing worse than LLMs, can still add value when teamed with them. A human and machine team can surpass each individual teammate when team members' confidence is well-calibrated and team members diverge in which tasks they find difficult (i.e., calibration and diversity are needed). We simplified and extended a Bayesian approach to combining judgments using a logistic regression framework that integrates confidence-weighted judgments for any number of team members. Using this straightforward method, we demonstrated its effectiveness in both image classification and neuroscience forecasting tasks. Combining human judgments with one or more machines consistently improved overall team performance. Our hope is that this simple and effective strategy for integrating the judgments of humans and machines will lead to productive collaborations.