CLMay 24, 2022

VIRATrustData: A Trust-Annotated Corpus of Human-Chatbot Conversations About COVID-19 Vaccines

Roni Friedman, João Sedoc, Shai Gretz, Assaf Toledo, Rose Weeks, Naor Bar-Zeev, Yoav Katz, Noam Slonim

arXiv:2205.12240v10.33 citationsh-index: 39

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of trust classification in medical chatbots for public health applications, but it is incremental as it focuses on dataset creation and initial benchmarking.

The authors tackled the problem of classifying trust levels in human-chatbot conversations about COVID-19 vaccines by annotating 3,000 conversational turns for trust categories and releasing the VIRATrustData dataset, demonstrating the task's non-trivial nature through model comparisons.

Public trust in medical information is crucial for successful application of public health policies such as vaccine uptake. This is especially true when the information is offered remotely, by chatbots, which have become increasingly popular in recent years. Here, we explore the challenging task of human-bot turn-level trust classification. We rely on a recently released data of observationally-collected (rather than crowdsourced) dialogs with VIRA chatbot, a COVID-19 Vaccine Information Resource Assistant. These dialogs are centered around questions and concerns about COVID-19 vaccines, where trust is particularly acute. We annotated $3k$ VIRA system-user conversational turns for Low Institutional Trust or Low Agent Trust vs. Neutral or High Trust. We release the labeled dataset, VIRATrustData, the first of its kind to the best of our knowledge. We demonstrate how this task is non-trivial and compare several models that predict the different levels of trust.

View on arXiv PDF

Similar