Reinforcement Learning in Multiple-UAV Networks: Deployment and Movement Design

被引:187
|
作者
Liu, Xiao [1 ]
Liu, Yuanwei [1 ]
Chen, Yue [1 ]
机构
[1] Queen Mary Univ London, London E1 4NS, England
关键词
Dynamic movement; Q-learning; quality of experience (QoE); three-dimensional deployment; unmanned aerial vehicle (UAV); UNMANNED AERIAL VEHICLES; RESOURCE-ALLOCATION; WIRELESS COMMUNICATIONS; COMMUNICATION; OPTIMIZATION; ALGORITHM; PLACEMENT; MOBILE; 5G;
D O I
10.1109/TVT.2019.2922849
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel framework is proposed for quality of experience driven deployment and dynamic movement of multiple unmanned aerial vehicles (UAVs). The problem of joint non-convex three-dimensional (3-D) deployment and dynamic movement of the UAVs is formulated for maximizing the sum mean opinion score of ground users, which is proved to be NP-hard. In the aim of solving this pertinent problem, a three-step approach is proposed for attaining 3-D deployment and dynamic movement of multiple UAVs. First, a genetic algorithm based K-means (GAK-means) algorithm is utilized for obtaining the cell partition of the users. Second, Q-learning based deployment algorithm is proposed, in which each UAV acts as an agent, making their own decision for attaining 3-D position by learning from trial and mistake. In contrast to the conventional genetic algorithm based learning algorithms, the proposed algorithm is capable of training the direction selection strategy offline. Third, Q-learning based movement algorithm is proposed in the scenario that the users are roaming. The proposed algorithm is capable of converging to an optimal state. Numerical results reveal that the proposed algorithms show a fast convergence rate after a small number of iterations. Additionally, the proposed Q-learning based deployment algorithm outperforms K-means algorithms and Iterative-GAKmean algorithms with low complexity.
引用
收藏
页码:8036 / 8049
页数:14
相关论文
共 50 条
  • [1] Multiple-UAV Reinforcement Learning Algorithm Based on Improved PPO in Ray Framework
    Zhan, Guang
    Zhang, Xinmiao
    Li, Zhongchao
    Xu, Lin
    Zhou, Deyun
    Yang, Zhen
    [J]. DRONES, 2022, 6 (07)
  • [2] Teaching and Learning Virtual Strategy for the Navigation of Multiple-UAV
    Bonilla, Edison L.
    Rodriguez, Jacson J.
    Acosta, Julio F.
    Andaluz, Victor H.
    [J]. 2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
  • [3] Multiple-UAV Coordination and Communications in Tactical Edge Networks
    Tortonesi, Mauro
    Stefanelli, Cesare
    Benvegnu, Erika
    Ford, Ken
    Suri, Niranjan
    Linderman, Mark
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2012, 50 (10) : 48 - 55
  • [4] Deployment and Movement for Multiple Aerial Base Stations by Reinforcement Learning
    Liu, Xiao
    Liu, Yuanwei
    Chen, Yue
    [J]. 2018 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2018,
  • [5] Optimum Aerial Base Station Deployment for UAV Networks: A Reinforcement Learning Approach
    Hou, Meng-Chun
    Deng, Der-Jiunn
    Wu, Chia-Ling
    [J]. 2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [6] Human-Autonomy Teaming Interface Design for Multiple-UAV Control
    Fedulin A.M.
    Evstaf’ev D.V.
    Kondrashova G.L.
    Artemenko N.V.
    [J]. Russian Aeronautics, 2022, 65 (02): : 419 - 424
  • [7] Adaptive Deployment of UAV-Aided Networks Based on Hybrid Deep Reinforcement Learning
    Ma, Xiaoyong
    Hu, Shuting
    Zhou, Danyang
    Zhou, Yi
    Lu, Ning
    [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [8] A multiple-UAV architecture for autonomous media production
    Mademlis, Ioannis
    Torres-Gonzalez, Arturo
    Capitan, Jesus
    Montagnuolo, Maurizio
    Messina, Alberto
    Negro, Fulvio
    Le Barz, Cedric
    Goncalves, Tiago
    Cunha, Rita
    Guerreiro, Bruno
    Zhang, Fan
    Boyle, Stephen
    Guerout, Gregoire
    Tefas, Anastasios
    Nikolaidis, Nikos
    Bull, David
    Pitas, Ioannis
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (02) : 1905 - 1934
  • [9] RIS-assisted UAV Networks: Deployment Optimization with Reinforcement-Learning-Based Federated Learning
    Wang, Hsuan-Fu
    Huang, Cheng-Sen
    Wang, Li-Chun
    [J]. 2021 30TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC 2021), 2021, : 257 - 262
  • [10] Cascade-type guidance law design for multiple-UAV formation keeping
    No, Tae Soo
    Kim, Youdan
    Tahk, Min-Jea
    Jeon, Gyeong-Eon
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2011, 15 (06) : 431 - 439