AIMay 13

SPIN: Structural LLM Planning via Iterative Navigation for Industrial Tasks

arXiv:2605.1405165.2

Predicted impact top 55% in AI · last 90 daysOriginality Incremental advance

AI Analysis

For industrial LLM agent systems, SPIN reduces structural errors and unnecessary tool costs via DAG-based planning and prefix execution control.

SPIN reduces executed tasks by 41% and improves task accomplishment from 0.638 to 0.706 on AssetOpsBench, while cutting tool calls per run from 11.81 to 6.82.

Industrial LLM agent systems often separate planning from execution, yet LLM planners frequently produce structurally invalid or unnecessarily long workflows, leading to brittle failures and avoidable tool and API cost. We propose \texttt{SPIN}, a planning wrapper that combines validated Directed Acyclic Graph (DAG) planning with prefix based execution control. \texttt{SPIN} enforces a strict DAG contract through \texttt{\_validate\_plan\_text} and repair prompting, producing executable plans before downstream execution, and then evaluates DAG prefixes incrementally to stop when the current prefix is sufficient to answer the query. On AssetOpsBench, across 261 scenarios, \texttt{SPIN} reduces executed tasks from 1061 to 623 and improves \emph{Accomplished} from 0.638 to 0.706, while reducing tool calls from 11.81 to 6.82 per run. On MCP Bench, the same wrapper improves planning, grounding, and dependency related scores for both GPT OSS1 and Llama 4 Maverick.

View on arXiv PDF

Similar