LGMLJun 20, 2018

Rethinking Machine Learning Development and Deployment for Edge Devices

arXiv:1806.07846v13 citations
Originality Synthesis-oriented
AI Analysis

This addresses deployment issues for edge computing applications, but it appears incremental as it builds on existing ML frameworks.

The paper tackles the challenge of deploying machine learning models on edge devices by proposing a new development and deployment approach optimized for inference-only scenarios, resulting in more efficient and high-quality solutions.

Machine learning (ML), especially deep learning is made possible by the availability of big data, enormous compute power and, often overlooked, development tools or frameworks. As the algorithms become mature and efficient, more and more ML inference is moving out of datacenters/cloud and deployed on edge devices. This model deployment process can be challenging as the deployment environment and requirements can be substantially different from those during model development. In this paper, we propose a new ML development and deployment approach that is specially designed and optimized for inference-only deployment on edge devices. We build a prototype and demonstrate that this approach can address all the deployment challenges and result in more efficient and high-quality solutions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes