Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/25934
Title: Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning
Authors: Zhang, C
Liang, S
He, C
Wang, K
Keywords: deep reinforcement learning;mobile edge computing;unmanned aerial vehicle (UAV);trajectory control;user association
Issue Date: 16-Feb-2021
Publisher: China InfoCom Media Group
Citation: Zhang, C. et al. (2022) 'Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning', Journal of Communications and Information Networks, 2022, 7 (2), pp. 192 - 201. doi: 10.23919/JCIN.2022.9815202
Abstract: In this paper,multi-unmanned aerial vehicle (multi-UAV) and multi-user system are studied, where UAVs are served as aerial base stations (BS) for ground users in the same frequency band without knowing the locations and channel parameters for the users. We aim to maximize the total throughput for all the users and meet the fairness requirement by optimizing the UAVs’ trajectories and transmission power in a centralized way. This problem is non-convex and very difficult to solve,as the locations of the user are unknown to the UAVs. We propose a deep reinforcement learning(DRL)-based solution,i.e.,soft actor-critic(SAC)to address it via modeling the problem as a Markov decision process (MDP). We carefully design the reward function that combines sparse with non-sparse reward to achieve the balance between exploitation and exploration.The simulation results show that the proposed SAC has a very good performance in terms of both training and testing.
URI: https://bura.brunel.ac.uk/handle/2438/25934
DOI: https://doi.org/10.23919/JCIN.2022.9815202
ISSN: 2096-1081
Other Identifiers: ORCID iD: Kezhi Wang https://orcid.org/0000-0001-8602-0800
Appears in Collections:Dept of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdf1.15 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.