IVAILGSep 25, 2020

Pareto-Optimal Bit Allocation for Collaborative Intelligence

arXiv:2009.12430v236 citations
AI Analysis

This work addresses the optimization of resource allocation in edge-cloud AI deployments, which is an incremental improvement for efficient mobile/edge computing.

The paper tackles the problem of bit allocation for feature coding in multi-stream collaborative intelligence systems, providing closed-form solutions for single-task and scalarized multi-task systems, and analytical characterization of Pareto sets for specific configurations, with results demonstrated on various DNN models.

In recent studies, collaborative intelligence (CI) has emerged as a promising framework for deployment of Artificial Intelligence (AI)-based services on mobile/edge devices. In CI, the AI model (a deep neural network) is split between the edge and the cloud, and intermediate features are sent from the edge sub-model to the cloud sub-model. In this paper, we study bit allocation for feature coding in multi-stream CI systems. We model task distortion as a function of rate using convex surfaces similar to those found in distortion-rate theory. Using such models, we are able to provide closed-form bit allocation solutions for single-task systems and scalarized multi-task systems. Moreover, we provide analytical characterization of the full Pareto set for 2-stream k-task systems, and bounds on the Pareto set for 3-stream 2-task systems. Analytical results are examined on a variety of DNN models from the literature to demonstrate wide applicability of the results

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes