NI DC LGDec 16, 2020

Online Service Migration in Mobile Edge with Incomplete System Information: A Deep Recurrent Actor-Critic Learning Approach

Jin Wang, Jia Hu, Geyong Min, Qiang Ni, Tarek El-Ghazawi

arXiv:2012.08679v52.354 citationsHas Code

Originality Incremental advance

AI Analysis

This work is significant for mobile users and MEC providers, as it offers a scalable and efficient method for maintaining Quality-of-Service during user roaming by making online migration decisions with incomplete system information, representing an incremental improvement over existing centralized and complete-information approaches.

This paper addresses the challenge of online service migration in Mobile Edge Computing (MEC) environments where system information is incomplete. The proposed user-centric deep recurrent actor-critic learning approach, which models the problem as a Partially Observable Markov Decision Process (POMDP), consistently outperforms existing heuristic and learning-driven algorithms, achieving near-optimal results across various MEC scenarios.

Multi-access Edge Computing (MEC) is an emerging computing paradigm that extends cloud computing to the network edge to support resource-intensive applications on mobile devices. As a crucial problem in MEC, service migration needs to decide how to migrate user services for maintaining the Quality-of-Service when users roam between MEC servers with limited coverage and capacity. However, finding an optimal migration policy is intractable due to the dynamic MEC environment and user mobility. Many existing studies make centralized migration decisions based on complete system-level information, which is time-consuming and also lacks desirable scalability. To address these challenges, we propose a novel learning-driven method, which is user-centric and can make effective online migration decisions by utilizing incomplete system-level information. Specifically, the service migration problem is modeled as a Partially Observable Markov Decision Process (POMDP). To solve the POMDP, we design a new encoder network that combines a Long Short-Term Memory (LSTM) and an embedding matrix for effective extraction of hidden information, and further propose a tailored off-policy actor-critic algorithm for efficient training. The extensive experimental results based on real-world mobility traces demonstrate that this new method consistently outperforms both the heuristic and state-of-the-art learning-driven algorithms and can achieve near-optimal results on various MEC scenarios.

View on arXiv PDF Code

Similar