Ecomap: Sustainability-Driven Optimization of Multi-Tenant DNN Execution on Edge Servers
This addresses efficiency and environmental sustainability for edge computing systems, though it is incremental as it builds on existing optimization methods.
The paper tackles the problem of managing multiple DNN workloads on edge servers to meet latency and sustainability goals, introducing Ecomap, which reduces carbon emissions by 30% and lowers carbon delay product by 25% compared to state-of-the-art methods.
Edge computing systems struggle to efficiently manage multiple concurrent deep neural network (DNN) workloads while meeting strict latency requirements, minimizing power consumption, and maintaining environmental sustainability. This paper introduces Ecomap, a sustainability-driven framework that dynamically adjusts the maximum power threshold of edge devices based on real-time carbon intensity. Ecomap incorporates the innovative use of mixed-quality models, allowing it to dynamically replace computationally heavy DNNs with lighter alternatives when latency constraints are violated, ensuring service responsiveness with minimal accuracy loss. Additionally, it employs a transformer-based estimator to guide efficient workload mappings. Experimental results using NVIDIA Jetson AGX Xavier demonstrate that Ecomap reduces carbon emissions by an average of 30% and achieves a 25% lower carbon delay product (CDP) compared to state-of-the-art methods, while maintaining comparable or better latency and power efficiency.