AICLCVGRLGOct 16, 2025

Agentic Design of Compositional Machines

arXiv:2510.14980v22 citationsh-index: 3Has Code
AI Analysis

This addresses the challenge of automating machine design for engineering and AI research, but it is incremental as it builds on existing LLM and simulation techniques.

The paper tackles the problem of using large language models (LLMs) to design complex machines from standardized components in a simulated environment, resulting in the introduction of the BesiegeField testbed and benchmarks showing current models fall short, with reinforcement learning explored for improvement.

The design of complex machines stands as both a marker of human intelligence and a foundation of engineering practice. Given recent advances in large language models (LLMs), we ask whether they, too, can learn to create. We approach this question through the lens of compositional machine design: a task in which machines are assembled from standardized components to meet functional demands like locomotion or manipulation in a simulated physical environment. With this simplification, machine design is expressed as writing XML-like code that explicitly specifies pairwise part connections. To support this investigation, we introduce BesiegeField, a testbed built on the machine-building game Besiege, which enables part-based construction, physical simulation and reward-driven evaluation. Using BesiegeField, we benchmark state-of-the-art LLMs with agentic workflows and identify key capabilities required for success, including spatial reasoning, strategic assembly, and instruction-following. As current open-source models fall short, we explore reinforcement learning (RL) as a path to improvement: we curate a cold-start dataset, conduct RL finetuning experiments, and highlight open challenges at the intersection of language, machine design, and physical reasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes