SPLGSep 23, 2020

Multi-Agent Deep Reinforcement Learning Based Trajectory Planning for Multi-UAV Assisted Mobile Edge Computing

arXiv:2009.11277v1382 citations
Originality Synthesis-oriented
AI Analysis

This work addresses trajectory planning for UAVs in mobile edge computing to improve fairness and energy efficiency, but it is incremental as it applies existing MADDPG methods to a specific domain.

The paper tackles the problem of jointly optimizing geographical fairness, load fairness, and energy consumption in a multi-UAV assisted mobile edge computing system, proposing a multi-agent deep reinforcement learning based trajectory planning algorithm that shows considerable performance improvements over traditional methods.

An unmanned aerial vehicle (UAV)-aided mobile edge computing (MEC) framework is proposed, where several UAVs having different trajectories fly over the target area and support the user equipments (UEs) on the ground. We aim to jointly optimize the geographical fairness among all the UEs, the fairness of each UAV' UE-load and the overall energy consumption of UEs. The above optimization problem includes both integer and continues variables and it is challenging to solve. To address the above problem, a multi-agent deep reinforcement learning based trajectory control algorithm is proposed for managing the trajectory of each UAV independently, where the popular Multi-Agent Deep Deterministic Policy Gradient (MADDPG) method is applied. Given the UAVs' trajectories, a low-complexity approach is introduced for optimizing the offloading decisions of UEs. We show that our proposed solution has considerable performance over other traditional algorithms, both in terms of the fairness for serving UEs, fairness of UE-load at each UAV and energy consumption for all the UEs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes