CLSep 26, 2025

Taxonomy of Comprehensive Safety for Clinical Agents

arXiv:2509.22041v31 citationsh-index: 4EMNLP
Originality Synthesis-oriented
AI Analysis

This work addresses safety concerns for clinical chatbot users, but it is incremental as it builds on existing methods with a specialized taxonomy.

The paper tackled the problem of inadequate safety in clinical chatbots by introducing TACOS, a 21-class taxonomy for safety filtering and tool selection, and validated it with a curated dataset and experiments that provided insights into data distribution and model knowledge.

Safety is a paramount concern in clinical chatbot applications, where inaccurate or harmful responses can lead to serious consequences. Existing methods--such as guardrails and tool calling--often fall short in addressing the nuanced demands of the clinical domain. In this paper, we introduce TACOS (TAxonomy of COmprehensive Safety for Clinical Agents), a fine-grained, 21-class taxonomy that integrates safety filtering and tool selection into a single user intent classification step. TACOS is a taxonomy that can cover a wide spectrum of clinical and non-clinical queries, explicitly modeling varying safety thresholds and external tool dependencies. To validate our taxonomy, we curate a TACOS-annotated dataset and perform extensive experiments. Our results demonstrate the value of a new taxonomy specialized for clinical agent settings, and reveal useful insights about train data distribution and pretrained knowledge of base models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes