Method Drift›LLM reasoning / chain-of-thought

Tracked

OLIVIA

OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents

LLM reasoning / chain-of-thought · first seen May 11, 2026

current frontier — recent, not yet superseded in the knowledge base

0 papers critique it · 0 beat it on benchmarks

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.

OLIVIA OLIVIA: Online Learning via Inference-time Action Adaptation for Decision Making in LLM ReAct Agents
May 11, 2026
Planner-centric Plan-Execute paradigm Beyond ReAct: A Planner-Centric Framework for Complex Tool-Augmented LLM Reasoning
Nov 13, 2025
SR^2 Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
Oct 9, 2025
DS-STAR DS-STAR: Data Science Agent via Iterative Planning and Verification
Sep 26, 2025