Method Drift›Tool use / function calling
Superseded baseline#26 of 55 most-superseded
SAGE
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement LearningTool use / function calling · first seen Dec 15, 2025
superseded — cited as a baseline and beaten by newer methods
0 papers critique it · 1 beat it on benchmarks
Beaten on benchmarks
Head-to-head results where a newer method reports beating SAGE. Values are copied from the source paper's tables — verify against the cited paper.
- Exploring Agentic Tool-Calling Decisions via Uncertainty-Aligned Reinforcement Learning
Turn-level TRUSTR beats SAGE · Acc Norm [From Qwen3-4B-Thinking, Turn-level training]
80.83 vs 73.36
- Exploring Agentic Tool-Calling Decisions via Uncertainty-Aligned Reinforcement Learning
Turn-level TRUSTR beats SAGE · Overall Score [From Qwen3-4B-Thinking, Turn-level training]
48.04 vs 41.16
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.