From Prompts to Pavement: LMMs-based Agentic Behavior-Tree Generation Framework for Autonomous Vehicles

Omar Y. Goba, Ahmed Y. Gado, Catherine M. Elias, Ahmed Hussein

arXiv:2601.123581 citationsh-index: 1

AI Analysis

For autonomous vehicle researchers, this work provides a proof-of-concept for using large models to automate behavior tree generation, though it is incremental as it combines existing techniques without major breakthroughs.

This paper introduces an agentic framework that uses LLMs and LVMs to dynamically generate and adapt behavior trees for autonomous vehicles, enabling successful navigation around unexpected obstacles without human intervention in simulation.

Autonomous vehicles (AVs) require adaptive behavior planners to navigate unpredictable, real-world environments safely. Traditional behavior trees (BTs) offer structured decision logic but are inherently static and demand labor-intensive manual tuning, limiting their applicability at SAE Level 5 autonomy. This paper presents an agentic framework that leverages large language models (LLMs) and multi-modal vision models (LVMs) to generate and adapt BTs on the fly. A specialized Descriptor agent applies chain-of-symbols prompting to assess scene criticality, a Planner agent constructs high-level sub-goals via in-context learning, and a Generator agent synthesizes executable BT sub-trees in XML format. Integrated into a CARLA+Nav2 simulation, our system triggers only upon baseline BT failure, demonstrating successful navigation around unexpected obstacles (e.g., street blockage) with no human intervention. Compared to a static BT baseline, this approach is a proof-of-concept that extends to diverse driving scenarios.

View on arXiv PDF

Similar