AI LGMay 28, 2025

Reinforced Reasoning for Embodied Planning

Di Wu, Jiaxin Fan, Junzhe Zang, Guanbo Wang, Wei Yin, Wenhao Li, Bo Jin

arXiv:2505.22050v218.112 citationsh-index: 12Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of long-horizon planning in embodied AI for agents, representing an incremental improvement through reinforcement-driven reasoning.

The paper tackled the problem of embodied planning, where agents struggle with temporal reasoning and spatial understanding, by introducing a reinforcement fine-tuning framework that significantly outperforms models like GPT-4o-mini and 70B+ baselines on the Embench benchmark, showing strong generalization to unseen environments.

Embodied planning requires agents to make coherent multi-step decisions based on dynamic visual observations and natural language goals. While recent vision-language models (VLMs) excel at static perception tasks, they struggle with the temporal reasoning, spatial understanding, and commonsense grounding needed for planning in interactive environments. In this work, we introduce a reinforcement fine-tuning framework that brings R1-style reasoning enhancement into embodied planning. We first distill a high-quality dataset from a powerful closed-source model and perform supervised fine-tuning (SFT) to equip the model with structured decision-making priors. We then design a rule-based reward function tailored to multi-step action quality and optimize the policy via Generalized Reinforced Preference Optimization (GRPO). Our approach is evaluated on Embench, a recent benchmark for interactive embodied tasks, covering both in-domain and out-of-domain scenarios. Experimental results show that our method significantly outperforms models of similar or larger scale, including GPT-4o-mini and 70B+ open-source baselines, and exhibits strong generalization to unseen environments. This work highlights the potential of reinforcement-driven reasoning to advance long-horizon planning in embodied AI.

View on arXiv PDF

Similar