UAV Coverage Path Planning With Quantum-Based Recurrent Deep Deterministic Policy Gradient

被引:0
|
作者
Silvirianti [2 ]
Narottama, Bhaskara [2 ]
Shin, Soo Young [1 ]
机构
[1] Kumoh Natl Inst Technol, Dept IT Convergence Engn, Gumi 39177, Gyeongsangbugdo, South Korea
[2] Univ Quebec, Inst Natl Rech Sci INRS, Montreal, PQ H5A 1K6, Canada
关键词
Autonomous aerial vehicles; Training; Optimization; NOMA; Encoding; Vehicle dynamics; Resource management; Deep deterministic policy gradient; energy efficiency; quantum embedding; recurrent; UAV communications; NETWORKS;
D O I
10.1109/TVT.2023.3347219
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study proposes quantum-based deep deterministic policy gradient (Q-DDPG) and quantum-based recurrent DDPG (Q-RDDPG) schemes for time-series optimization in UAV communications. Herein, Q-DDPG-based actor-critic reinforcement learning is utilized to optimize action selections in a large state and continuous action space. In this scheme, quantum models are exploited to reduce computational complexity and training loss. As a particular case, Q-DDPG and Q-RDDPG are employed for trajectory optimization and dynamic resource allocation in UAV communications. The results demonstrate that Q-DDPG and Q-RDDPG schemes achieved higher rewards with lower training losses compared to classical DDPG.
引用
收藏
页码:7424 / 7429
页数:6
相关论文
共 50 条
  • [1] UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient
    Wu, Runjia
    Gu, Fangqing
    Liu, Hai-lin
    Shi, Hongjian
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [2] A multi-critic deep deterministic policy gradient UAV path planning
    Wu, Runjia
    Gu, Fangqing
    Huang, Jie
    [J]. 2020 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2020), 2020, : 6 - 10
  • [3] Path planning based on improved Deep Deterministic Policy Gradient algorithm
    Liu, Yandong
    Zhang, Wenzhi
    Chen, Fumin
    Li, Jianliang
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 295 - 299
  • [4] Path Planning of Humanoid Arm Based on Deep Deterministic Policy Gradient
    Wen, Shuhuan
    Chen, Jianhua
    Wang, Shen
    Zhang, Hong
    Hu, Xueheng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1755 - 1760
  • [5] Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient
    Gong, Hui
    Wang, Peng
    Ni, Cui
    Cheng, Nuo
    [J]. SENSORS, 2022, 22 (09)
  • [6] Compensation Control of UAV Based on Deep Deterministic Policy Gradient
    Xu, Zijun
    Qi, Juntong
    Wang, Mingming
    Wu, Chong
    Yang, Guang
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2289 - 2296
  • [7] Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm
    Zhang, Shuzhen
    Tang, Wei
    Li, Panpan
    Zha, Fusheng
    [J]. SENSORS, 2024, 24 (17)
  • [8] Local Path Planning with Turnabouts for Mobile Robot by Deep Deterministic Policy Gradient
    Nakamura, Tomoaki
    Kobayashi, Masato
    Motoi, Naoki
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS, ICM, 2023,
  • [9] Deep deterministic policy gradient algorithm for crowd-evacuation path planning
    Li, Xinjin
    Liu, Hong
    Li, Junqing
    Li, Yan
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 161
  • [10] Deep deterministic policy gradient algorithm for UAV control
    Huang, Xu
    Liu, Jiarun
    Jia, Chenhui
    Wang, Zhaolei
    Zhang, Jun
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (11):