SPITLGMLMay 5, 2023

Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

arXiv:2305.03571v28 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of deploying semantic communication in practical wireless systems by enabling model-free learning, though it is incremental as it builds on existing reinforcement learning methods.

The authors tackled the problem of designing a semantic communication system without requiring a known or differentiable channel model by applying Stochastic Policy Gradient (SPG) reinforcement learning, achieving comparable performance to a model-aware approach but with decreased convergence rate.

Following the recent success of Machine Learning tools in wireless communications, the idea of semantic communication by Weaver from 1949 has gained attention. It breaks with Shannon's classic design paradigm by aiming to transmit the meaning, i.e., semantics, of a message instead of its exact version, allowing for information rate savings. In this work, we apply the Stochastic Policy Gradient (SPG) to design a semantic communication system by reinforcement learning, separating transmitter and receiver, and not requiring a known or differentiable channel model -- a crucial step towards deployment in practice. Further, we derive the use of SPG for both classic and semantic communication from the maximization of the mutual information between received and target variables. Numerical results show that our approach achieves comparable performance to a model-aware approach based on the reparametrization trick, albeit with a decreased convergence rate.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes