Method Drift›Tool use / function calling
Superseded baseline#10 of 55 most-superseded
xLAM
xLAM: A Family of Large Action Models to Empower AI Agent SystemsTool use / function calling · first seen Sep 5, 2024
superseded — cited as a baseline and beaten by newer methods
0 papers critique it · 2 beat it on benchmarks
Beaten on benchmarks
Head-to-head results where a newer method reports beating xLAM. Values are copied from the source paper's tables — verify against the cited paper.
- Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky
Llama-3.3-DiaFORGE-70B beats xLAM · Acc [Llama-3.3-70B family]
0.79 vs 0.51
- ASA: Activation Steering for Tool-Calling Domain Adaptation
ASA beats xLAM · Overall First Call Accuracy [NESTFUL evaluation]
41.94 vs 38.98
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.