Is AGrail superseded?

AGrail (Tool use / function calling): superseded — cited as a baseline and beaten by newer methods. 1 paper(s) critique it, 0 beat it on benchmarks — #31 of 55 most-superseded. Sub-problem: cluster led by AgentAuditor. Newer alternatives in the same sub-problem include ToolSafe.

Method Drift›Tool use / function calling

Superseded baseline#31 of 55 most-superseded

AGrail

AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection

Tool use / function calling · first seen Feb 17, 2025

superseded — cited as a baseline and beaten by newer methods

1 papers critique it · 0 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites AGrail as a baseline.

“ShieldAgent and AGrail produce safety judgments via complex reasoning and verification pipelines, incurring high latency that makes them impractical for step-level monitoring of tool invocation in LLM-based agents.”
— ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.

ToolSafe ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback
Jan 15, 2026