Edge Large AI Models: Revolutionizing 6G Networks
This work addresses the problem of enabling real-time intelligent services in 6G networks for telecommunications and AI developers, though it appears incremental as it builds on existing edge AI concepts with a focus on scaling to larger models.
The paper tackles the challenge of deploying large AI models on resource-constrained edge devices for 6G networks by proposing collaborative fine-tuning, full-parameter training frameworks, and a microservice-assisted inference architecture, with applications in channel prediction and beamforming to enhance wireless network performance.
Large artificial intelligence models (LAMs) possess human-like abilities to solve a wide range of real-world problems, exemplifying the potential of experts in various domains and modalities. By leveraging the communication and computation capabilities of geographically dispersed edge devices, edge LAM emerges as an enabling technology to empower the delivery of various real-time intelligent services in 6G. Unlike traditional edge artificial intelligence (AI) that primarily supports a single task using small models, edge LAM is featured by the need of the decomposition and distributed deployment of large models, and the ability to support highly generalized and diverse tasks. However, due to limited communication, computation, and storage resources over wireless networks, the vast number of trainable neurons and the substantial communication overhead pose a formidable hurdle to the practical deployment of edge LAMs. In this paper, we investigate the opportunities and challenges of edge LAMs from the perspectives of model decomposition and resource management. Specifically, we propose collaborative fine-tuning and full-parameter training frameworks, alongside a microservice-assisted inference architecture, to enhance the deployment of edge LAM over wireless networks. Additionally, we investigate the application of edge LAM in air-interface designs, focusing on channel prediction and beamforming. These innovative frameworks and applications offer valuable insights and solutions for advancing 6G technology.