AICLOct 21, 2024

On-Device LLMs for SMEs: Challenges and Opportunities

arXiv:2410.16070v25 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

It addresses the problem of deploying LLMs on-device for SMEs, which is incremental as it reviews existing challenges and opportunities without introducing new methods.

This paper systematically reviews the infrastructure requirements for deploying Large Language Models (LLMs) on-device for small and medium-sized enterprises (SMEs), identifying challenges like limited computational resources and exploring opportunities through hardware and software innovations to enhance technological resilience.

This paper presents a systematic review of the infrastructure requirements for deploying Large Language Models (LLMs) on-device within the context of small and medium-sized enterprises (SMEs), focusing on both hardware and software perspectives. From the hardware viewpoint, we discuss the utilization of processing units like GPUs and TPUs, efficient memory and storage solutions, and strategies for effective deployment, addressing the challenges of limited computational resources typical in SME settings. From the software perspective, we explore framework compatibility, operating system optimization, and the use of specialized libraries tailored for resource-constrained environments. The review is structured to first identify the unique challenges faced by SMEs in deploying LLMs on-device, followed by an exploration of the opportunities that both hardware innovations and software adaptations offer to overcome these obstacles. Such a structured review provides practical insights, contributing significantly to the community by enhancing the technological resilience of SMEs in integrating LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes