DCAICLLGNov 7, 2025

Characterizing and Understanding Energy Footprint and Efficiency of Small Language Model on Edges

arXiv:2511.11624v1h-index: 6MASS
Originality Synthesis-oriented
AI Analysis

It provides practical insights for deploying AI in energy-constrained environments like smart systems and mobile platforms, but is incremental as it benchmarks existing models.

This study evaluated the power efficiency of five small language models on edge devices, finding that Jetson Orin Nano with GPU acceleration achieved the highest energy-to-performance ratio and Llama 3.2 offered the best balance of accuracy and power efficiency.

Cloud-based large language models (LLMs) and their variants have significantly influenced real-world applications. Deploying smaller models (i.e., small language models (SLMs)) on edge devices offers additional advantages, such as reduced latency and independence from network connectivity. However, edge devices' limited computing resources and constrained energy budgets challenge efficient deployment. This study evaluates the power efficiency of five representative SLMs - Llama 3.2, Phi-3 Mini, TinyLlama, and Gemma 2 on Raspberry Pi 5, Jetson Nano, and Jetson Orin Nano (CPU and GPU configurations). Results show that Jetson Orin Nano with GPU acceleration achieves the highest energy-to-performance ratio, significantly outperforming CPU-based setups. Llama 3.2 provides the best balance of accuracy and power efficiency, while TinyLlama is well-suited for low-power environments at the cost of reduced accuracy. In contrast, Phi-3 Mini consumes the most energy despite its high accuracy. In addition, GPU acceleration, memory bandwidth, and model architecture are key in optimizing inference energy efficiency. Our empirical analysis offers practical insights for AI, smart systems, and mobile ad-hoc platforms to leverage tradeoffs from accuracy, inference latency, and power efficiency in energy-constrained environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes