MAApr 16

FedGUI: Benchmarking Federated GUI Agents across Heterogeneous Platforms, Devices, and Operating Systems

arXiv:2604.1495648.3h-index: 9Has Code
Predicted impact top 1% in MA · last 90 daysOriginality Incremental advance
AI Analysis

For researchers developing privacy-preserving GUI agents, this benchmark enables systematic study of cross-platform heterogeneity, filling a gap in existing federated learning benchmarks.

FedGUI introduces the first benchmark for federated GUI agents across mobile, web, and desktop platforms, showing that cross-platform collaboration improves performance and identifying platform and OS as key heterogeneity factors.

Training GUI agents with traditional centralized methods faces significant cost and scalability challenges. Federated learning (FL) offers a promising solution, yet its potential is hindered by the lack of benchmarks that capture real-world, cross-platform heterogeneity. To bridge this gap, we introduce FedGUI, the first comprehensive benchmark for developing and evaluating federated GUI agents across mobile, web, and desktop platforms. FedGUI provides a suite of six curated datasets to systematically study four crucial types of heterogeneity: cross-platform, cross-device, cross-OS, and cross-source. Extensive experiments reveal several key insights: First, we show that cross-platform collaboration improves performance, extending prior mobile-only federated learning to diverse GUI environments; Second, we demonstrate the presence of distinct heterogeneity dimensions and identify platform and OS as the most influential factors. FedGUI provides a vital foundation for the community to build more scalable and privacy-preserving GUI agents for real-world deployment. Our code and data are publicly available at https://github.com/wwh0411/FedGUI..

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes