CLAILGOct 16, 2025

LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training

Meta AI
arXiv:2510.14969v18 citationsh-index: 7Has Code
Originality Highly original
AI Analysis

This addresses the challenge of data scarcity for digital agent training, offering a scalable solution that reduces reliance on costly human annotation.

The paper tackles the problem of expensive data collection for training digital agents by introducing UI-Simulator, a scalable paradigm that generates synthetic UI trajectories, achieving performance rivaling or surpassing agents trained on real UIs in experiments on WebArena and AndroidWorld with better robustness.

Digital agents require diverse, large-scale UI trajectories to generalize across real-world tasks, yet collecting such data is prohibitively expensive in both human annotation, infra and engineering perspectives. To this end, we introduce $\textbf{UI-Simulator}$, a scalable paradigm that generates structured UI states and transitions to synthesize training trajectories at scale. Our paradigm integrates a digital world simulator for diverse UI states, a guided rollout process for coherent exploration, and a trajectory wrapper that produces high-quality and diverse trajectories for agent training. We further propose $\textbf{UI-Simulator-Grow}$, a targeted scaling strategy that enables more rapid and data-efficient scaling by prioritizing high-impact tasks and synthesizes informative trajectory variants. Experiments on WebArena and AndroidWorld show that UI-Simulator rivals or surpasses open-source agents trained on real UIs with significantly better robustness, despite using weaker teacher models. Moreover, UI-Simulator-Grow matches the performance of Llama-3-70B-Instruct using only Llama-3-8B-Instruct as the base model, highlighting the potential of targeted synthesis scaling paradigm to continuously and efficiently enhance the digital agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes