SYAISYApr 9

Networking-Aware Energy Efficiency in Agentic AI Inference: A Survey

arXiv:2604.0785730.3
Predicted impact top 1% in SY · last 90 daysOriginality Synthesis-oriented
AI Analysis

It addresses energy challenges for Agentic AI systems in mobile edge computing and wireless networks, but is incremental as it synthesizes existing knowledge into a survey.

This survey tackles the energy efficiency problem in Agentic AI inference by analyzing computational and communication costs, proposing a framework and taxonomy for optimization without presenting specific numerical results.

The rapid emergence of Large Language Models (LLMs) has catalyzed Agentic artificial intelligence (AI), autonomous systems integrating perception, reasoning, and action into closed-loop pipelines for continuous adaptation. While unlocking transformative applications in mobile edge computing, autonomous systems, and next-generation wireless networks, this paradigm creates fundamental energy challenges through iterative inference and persistent data exchange. Unlike traditional AI where bottlenecks are computational Floating Point Operations (FLOPs), Agentic AI faces compounding computational and communication energy costs. In this survey, we propose an energy accounting framework identifying computational and communication costs across the Perception-Reasoning-Action cycle. We establish a unified taxonomy spanning model simplification, computation control, input and attention optimization, and hardware-aware inference. We explore cross-layer co-design strategies jointly optimizing model parameters, wireless transmissions, and edge resources. Finally, we identify open challenges of federated green learning, carbon-aware agency, 6th generation mobile communication (6G)-native Agentic AI, and self-sustaining systems, providing a roadmap for scalable autonomous intelligence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes