LG AIApr 24

Reliable Self-Harm Risk Screening via Adaptive Multi-Agent LLM Systems

arXiv:2604.2215421.9h-index: 2

AI Analysis

For safety-critical behavioral health screening, this work offers a principled method to improve precision without sacrificing recall, addressing reliability concerns in multi-agent LLM systems.

The paper introduces a statistical framework for multi-agent LLM pipelines that provides adaptive decision-making with regret guarantees, achieving a false positive rate of 0.095 on the AEGIS 2.0 dataset, reducing incorrect flagging of safe content by 40% compared to single-agent models while maintaining similar false negative rates.

Emerging AI systems in behavioral health and psychiatry use multi-step or multi-agent LLM pipelines for tasks like assessing self-harm risk and screening for depression. However, common evaluation approaches, like LLM-as-a-judge, do not indicate when a decision is reliable or how errors may accumulate across multiple LLM judgements, limiting their suitability for safety-critical settings. We present a statistical framework for multi-agent pipelines structured as directed acyclic graphs (DAGs) that provides an alternative to heuristic voting with principled, adaptive decision-making. We model each agent as a stochastic categorical decision and introduce (1) tighter agent-level performance confidence bounds, (2) a bandit-based adaptive sampling strategy based on input difficulty, and (3) regret guarantees over the multi-agent system that shows logarithmic error growth when deployed. We evaluate our system on two labeled datasets in behavioral health : the AEGIS 2.0 behavioral health subset (N=161) and a stratified sample of SWMH Reddit posts (N=250). Empirically, our adaptive sampling strategy achieves the lowest false positive rate of any condition across both datasets, 0.095 on AEGIS 2.0 compared to 0.159 for single-agent models, reducing incorrect flagging of safe content by 40\% and still having similar false negative rates across all conditions. These results suggest that principled adaptive sampling offers a meaningful improvement in precision without reducing recall in this setting.

View on arXiv PDF

Similar