SY LGMar 6, 2025

AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services

Xiaoqi Wang, Hongyang Du, Yuehong Gao, Dong In Kim

arXiv:2503.04418v11.2h-index: 44

Originality Incremental advance

AI Analysis

This addresses the growing concern of energy consumption and carbon emissions in LLM services, offering a domain-specific optimization for more sustainable deployment.

The paper tackles the environmental impact of large language model services by proposing AOLO, a framework that analyzes and optimizes carbon footprint across computational inference and wireless communication, achieving an 18.77% reduction in overall carbon footprint compared to a benchmark method.

Recent advancements in large language models (LLMs) have led to their widespread adoption and large-scale deployment across various domains. However, their environmental impact, particularly during inference, has become a growing concern due to their substantial energy consumption and carbon footprint. Existing research has focused on inference computation alone, overlooking the analysis and optimization of carbon footprint in network-aided LLM service systems. To address this gap, we propose AOLO, a framework for analysis and optimization for low-carbon oriented wireless LLM services. AOLO introduces a comprehensive carbon footprint model that quantifies greenhouse gas emissions across the entire LLM service chain, including computational inference and wireless communication. Furthermore, we formulate an optimization problem aimed at minimizing the overall carbon footprint, which is solved through joint optimization of inference outputs and transmit power under quality-of-experience and system performance constraints. To achieve this joint optimization, we leverage the energy efficiency of spiking neural networks (SNNs) by adopting SNN as the actor network and propose a low-carbon-oriented optimization algorithm, i.e., SNN-based deep reinforcement learning (SDRL). Comprehensive simulations demonstrate that SDRL algorithm significantly reduces overall carbon footprint, achieving an 18.77% reduction compared to the benchmark soft actor-critic, highlighting its potential for enabling more sustainable LLM inference services.

View on arXiv PDF

Similar