LG ARJan 22, 2025

Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems

Marco Angioli, Marcello Barbirotta, Abdallah Cheikh, Antonio Mastrandrea, Francesco Menichelli, Mauro Olivieri

arXiv:2501.13139v113.06 citationsh-index: 12ACM Trans Embed Comput Syst

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of enabling real-time AI decision-making on low-power edge devices for IoT applications, but it is incremental as it optimizes existing methods.

The paper tackled the challenge of implementing LinearUCB contextual bandit algorithms on resource-constrained embedded devices by combining algorithmic modifications and vector acceleration, resulting in notable improvements in execution time and energy consumption.

As the Internet of Things expands, embedding Artificial Intelligence algorithms in resource-constrained devices has become increasingly important to enable real-time, autonomous decision-making without relying on centralized cloud servers. However, implementing and executing complex algorithms in embedded devices poses significant challenges due to limited computational power, memory, and energy resources. This paper presents algorithmic and hardware techniques to efficiently implement two LinearUCB Contextual Bandits algorithms on resource-constrained embedded devices. Algorithmic modifications based on the Sherman-Morrison-Woodbury formula streamline model complexity, while vector acceleration is harnessed to speed up matrix operations. We analyze the impact of each optimization individually and then combine them in a two-pronged strategy. The results show notable improvements in execution time and energy consumption, demonstrating the effectiveness of combining algorithmic and hardware optimizations to enhance learning models for edge computing environments with low-power and real-time requirements.

View on arXiv PDF

Similar