CLApr 1

CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance

arXiv:2604.0111391.0
AI Analysis

This addresses the challenge of handling conflicting clinical evidence in high-stakes decision-making for healthcare settings, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the problem of LLM systems performing poorly when evidence is internally inconsistent, such as in healthcare where patient-reported symptoms contradict medical signs, and introduces CARE, a multi-stage privacy-compliant agentic reasoning framework that achieves stronger performance across all key metrics compared to baselines.

Large language model (LLM) systems are increasingly used to support high-stakes decision-making, but they typically perform worse when the available evidence is internally inconsistent. Such a scenario exists in real-world healthcare settings, with patient-reported symptoms contradicting medical signs. To study this problem, we introduce MIMIC-DOS, a dataset for short-horizon organ dysfunction worsening prediction in the intensive care unit (ICU) setting. We derive this dataset from the widely recognized MIMIC-IV, a publicly available electronic health record dataset, and construct it exclusively from cases in which discordance between signs and symptoms exists. This setting poses a substantial challenge for existing LLM-based approaches, with single-pass LLMs and agentic pipelines often struggling to reconcile such conflicting signals. To address this problem, we propose CARE: a multi-stage privacy-compliant agentic reasoning framework in which a remote LLM provides guidance by generating structured categories and transitions without accessing sensitive patient data, while a local LLM uses these categories and transitions to support evidence acquisition and final decision-making. Empirically, CARE achieves stronger performance across all key metrics compared to multiple baseline settings, showing that CARE can more robustly handle conflicting clinical evidence while preserving privacy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes