DCMar 24

WWW.Serve: Interconnecting Global LLM Services through Decentralization

arXiv:2603.2066165.1h-index: 9
Predicted impact top 15% in DC · last 90 daysOriginality Incremental advance
AI Analysis

This addresses scalability bottlenecks for global LLM service providers by enabling decentralized resource allocation, though it builds incrementally on existing decentralization concepts.

The paper tackles the scalability and underutilization issues of centralized LLM services by proposing WWW.Serve, a decentralized framework that improves global service-level-objective attainment by up to 1.5x and reduces latency by 27.6%.

Large language model (LLM) services are mostly centralized, leading to scalability bottlenecks and underutilization of substantial scattered GPU resources. While decentralization offers a promising alternative, existing frameworks primarily focus on cooperation among GPU providers while overlooking their inherent competitive dynamics, imposing substantial constraints such as excessive platform-level oversight or rigid requirements to execute all assigned requests using fixed software stacks on fixed hardware configurations. We argue that such assumptions are unrealistic in real-world decentralized environments. To this end, we propose WWW$.$Serve, a decentralized framework for interconnecting LLM services worldwide. It allows participants to flexibly determine their participation policies and resource commitments, and supports self-organizing request dispatch, enabling the network to autonomously allocate requests without centralized coordination. Empirically, we show that WWW$.$Serve improves global SLO (service-level-objective) attainment by up to 1.5x and lowers latency by 27.6%. Its performance approaches, and in some cases surpasses, centralized scheduling, while fully preserving the benefits of decentralization. These results highlight WWW$.$Serve as a promising foundation for real-world, decentralized LLM serving.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes