LGOct 30, 2021

Throughput and Latency in the Distributed Q-Learning Random Access mMTC Networks

arXiv:2111.00299v112 citations
Originality Incremental advance
AI Analysis

This work addresses random access collisions for devices in mMTC networks, representing an incremental improvement over existing Q-learning methods.

The paper tackles the random access problem in massive machine-type communication networks by proposing a distributed packet-based Q-learning method that adjusts rewards to favor devices with more remaining packets, achieving a better throughput-latency trade-off than independent and collaborative techniques in practical scenarios.

In mMTC mode, with thousands of devices trying to access network resources sporadically, the problem of random access (RA) and collisions between devices that select the same resources becomes crucial. A promising approach to solve such an RA problem is to use learning mechanisms, especially the Q-learning algorithm, where the devices learn about the best time-slot periods to transmit through rewards sent by the central node. In this work, we propose a distributed packet-based learning method by varying the reward from the central node that favors devices having a larger number of remaining packets to transmit. Our numerical results indicated that the proposed distributed packet-based Q-learning method attains a much better throughput-latency trade-off than the alternative independent and collaborative techniques in practical scenarios of interest. In contrast, the number of payload bits of the packet-based technique is reduced regarding the collaborative Q-learning RA technique for achieving the same normalized throughput.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes