LG AIApr 7, 2025

Optimizing Large Language Models: Metrics, Energy Efficiency, and Case Study Insights

Tahniat Khan, Soroor Motie, Sedef Akinli Kocak, Shaina Raza

arXiv:2504.06307v113.09 citationsh-index: 11CAI

Originality Synthesis-oriented

AI Analysis

It addresses sustainability challenges in AI deployment for resource-constrained environments, offering incremental improvements to reduce environmental impact.

This paper tackles the high energy consumption and carbon emissions of large language models (LLMs) by integrating energy-efficient optimization techniques, achieving up to 45% reductions in energy use and emissions through quantization and local inference without compromising performance.

The rapid adoption of large language models (LLMs) has led to significant energy consumption and carbon emissions, posing a critical challenge to the sustainability of generative AI technologies. This paper explores the integration of energy-efficient optimization techniques in the deployment of LLMs to address these environmental concerns. We present a case study and framework that demonstrate how strategic quantization and local inference techniques can substantially lower the carbon footprints of LLMs without compromising their operational effectiveness. Experimental results reveal that these methods can reduce energy consumption and carbon emissions by up to 45\% post quantization, making them particularly suitable for resource-constrained environments. The findings provide actionable insights for achieving sustainability in AI while maintaining high levels of accuracy and responsiveness.

View on arXiv PDF

Similar