SPLGFeb 8, 2021

Learning Task-Oriented Communication for Edge Inference: An Information Bottleneck Approach

arXiv:2102.04170v3343 citations
AI Analysis

This work provides an incremental improvement for edge inference systems by reducing communication overhead and latency for devices with limited bandwidth.

This paper addresses the challenge of efficient communication for edge inference by jointly optimizing feature extraction, source coding, and channel coding. It proposes a learning-based scheme using a variational information bottleneck (VIB) framework with a sparsity-inducing prior and dynamic neural networks, resulting in a better rate-distortion tradeoff and reduced feature transmission latency compared to baseline methods.

This paper investigates task-oriented communication for edge inference, where a low-end edge device transmits the extracted feature vector of a local data sample to a powerful edge server for processing. It is critical to encode the data into an informative and compact representation for low-latency inference given the limited bandwidth. We propose a learning-based communication scheme that jointly optimizes feature extraction, source coding, and channel coding in a task-oriented manner, i.e., targeting the downstream inference task rather than data reconstruction. Specifically, we leverage an information bottleneck (IB) framework to formalize a rate-distortion tradeoff between the informativeness of the encoded feature and the inference performance. As the IB optimization is computationally prohibitive for the high-dimensional data, we adopt a variational approximation, namely the variational information bottleneck (VIB), to build a tractable upper bound. To reduce the communication overhead, we leverage a sparsity-inducing distribution as the variational prior for the VIB framework to sparsify the encoded feature vector. Furthermore, considering dynamic channel conditions in practical communication systems, we propose a variable-length feature encoding scheme based on dynamic neural networks to adaptively adjust the activated dimensions of the encoded feature to different channel conditions. Extensive experiments evidence that the proposed task-oriented communication system achieves a better rate-distortion tradeoff than baseline methods and significantly reduces the feature transmission latency in dynamic channel conditions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes