MAApr 16

FedGUI: Benchmarking Federated GUI Agents across Heterogeneous Platforms, Devices, and Operating Systems

Wenhao Wang, Haoting Shi, Mengying Yuan, Yiquan Lin, Panrong Tong, Hanzhang Zhou, Guangyi Liu, Pengxiang Zhao, Yue Wang, Siheng Chen

arXiv:2604.1495648.3h-index: 9Has Code

Predicted impact top 1% in MA · last 90 daysOriginality Incremental advance

AI Analysis

For researchers developing privacy-preserving GUI agents, this benchmark enables systematic study of cross-platform heterogeneity, filling a gap in existing federated learning benchmarks.

FedGUI introduces the first benchmark for federated GUI agents across mobile, web, and desktop platforms, showing that cross-platform collaboration improves performance and identifying platform and OS as key heterogeneity factors.

Training GUI agents with traditional centralized methods faces significant cost and scalability challenges. Federated learning (FL) offers a promising solution, yet its potential is hindered by the lack of benchmarks that capture real-world, cross-platform heterogeneity. To bridge this gap, we introduce FedGUI, the first comprehensive benchmark for developing and evaluating federated GUI agents across mobile, web, and desktop platforms. FedGUI provides a suite of six curated datasets to systematically study four crucial types of heterogeneity: cross-platform, cross-device, cross-OS, and cross-source. Extensive experiments reveal several key insights: First, we show that cross-platform collaboration improves performance, extending prior mobile-only federated learning to diverse GUI environments; Second, we demonstrate the presence of distinct heterogeneity dimensions and identify platform and OS as the most influential factors. FedGUI provides a vital foundation for the community to build more scalable and privacy-preserving GUI agents for real-world deployment. Our code and data are publicly available at https://github.com/wwh0411/FedGUI..

View on arXiv PDF Code

Similar