CLSep 26, 2025

Taxonomy of Comprehensive Safety for Clinical Agents

Jean Seo, Hyunkyung Lee, Gibaeg Kim, Wooseok Han, Jaehyo Yoo, Seungseop Lim, Kihun Shin, Eunho Yang

arXiv:2509.22041v31 citationsh-index: 4EMNLP

Originality Synthesis-oriented

AI Analysis

This work addresses safety concerns for clinical chatbot users, but it is incremental as it builds on existing methods with a specialized taxonomy.

The paper tackled the problem of inadequate safety in clinical chatbots by introducing TACOS, a 21-class taxonomy for safety filtering and tool selection, and validated it with a curated dataset and experiments that provided insights into data distribution and model knowledge.

Safety is a paramount concern in clinical chatbot applications, where inaccurate or harmful responses can lead to serious consequences. Existing methods--such as guardrails and tool calling--often fall short in addressing the nuanced demands of the clinical domain. In this paper, we introduce TACOS (TAxonomy of COmprehensive Safety for Clinical Agents), a fine-grained, 21-class taxonomy that integrates safety filtering and tool selection into a single user intent classification step. TACOS is a taxonomy that can cover a wide spectrum of clinical and non-clinical queries, explicitly modeling varying safety thresholds and external tool dependencies. To validate our taxonomy, we curate a TACOS-annotated dataset and perform extensive experiments. Our results demonstrate the value of a new taxonomy specialized for clinical agent settings, and reveal useful insights about train data distribution and pretrained knowledge of base models.

View on arXiv PDF

Similar