Method Drift›Tool use / function calling
Superseded baseline#47 of 55 most-superseded
Search-R1
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement LearningTool use / function calling · first seen Mar 12, 2025
superseded — cited as a baseline and beaten by newer methods
1 papers critique it · 0 beat it on benchmarks
What papers say
Verbatim critique sentences, each from a paper that cites Search-R1 as a baseline.
“Because the reward is shared across all segments, the contribution of any single tool call is hard to isolate, and unnecessary tool calls on easy questions still receive positive reinforcement when the episode succeeds.”
— Knowing When to Ask: Segment-Level Credit Assignment for LLM Tool Use
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.