Layerwise Quantum Deep Reinforcement Learning for Joint Optimization of UAV Trajectory and Resource Allocation

被引:0
|
作者
Silvirianti [2 ]
Narottama, Bhaskara [2 ]
Shin, Soo Young [1 ]
机构
[1] Kumoh Natl Inst Technol, Dept IT Convergence Engn, WENS Lab, Gumi 39177, Gyeongsangbuk, South Korea
[2] Univ Quebec, Inst Natl Rech Sci INRS, Montreal, PQ H5A 1K6, Canada
基金
新加坡国家研究基金会;
关键词
Quantum computing; Training; Quantum state; Optimization; Autonomous aerial vehicles; Trajectory; Resource management; Deep reinforcement learning; joint optimization; layerwise training; local loss; quantum embedding; unmanned aerial vehicle (UAV); NETWORKS; POWER;
D O I
10.1109/JIOT.2023.3285968
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a layerwise quantum-based deep reinforcement learning (LQ-DRL) method for optimizing continuous large space and time series problems using deep-layer training. The actions in LQ-DRL are optimized using a layerwise quantum embedding that leverages the advantages of quantum computing to maximize reward and reduce training loss. Moreover, this study employs a local loss to minimize the occurrence of barren plateaus phenomena and further enhance performance. As a particular case, the proposed scheme is employed to jointly optimize: 1) unmanned aerial vehicle (UAV) trajectory planning; 2) user grouping; and 3) power allocation for higher energy efficiency of a UAV as the reward. The combination of these optimized factors is referred to as action space in the presented LQ-DRL. The LQ-DRL is employed to solve the optimization problem due to its nonconvexity, continuous and large action space, and time-series domain. In a practical view, LQ-DRL aims to solve the issue of energy consumption related to limited-battery energy of a UAV base station (BS) while maintaining Quality of Service (QoS) for users, by gaining maximum energy efficiency as the reward. One of real applications, as an example, LQ-DRL can be employed to maximize the energy efficiency of a UAV BS in UAV empowered disaster recovery networks scenario. The quantum circuits of layerwise quantum embedding are presented to show the practical implementation in noisy intermediate-scale quantum computers. Based on the results, LQ-DRL outperformed the classical DRL by achieving higher effective dimension, rewards, and lower learning losses. In addition, better performances were achieved using more layers.
引用
收藏
页码:430 / 443
页数:14
相关论文
共 50 条
  • [1] Joint Optimization of UAV Trajectory and Resource Allocation for Federal Learning
    Yao, Xiancai
    Zheng, Jianchao
    Zheng, Xin
    Yang, Xiaolong
    [J]. Computer Engineering and Applications, 2024, 60 (11) : 336 - 345
  • [2] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks
    Chen, Peixin
    Zhao, Jian
    Shen, Furao
    [J]. 2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 933 - 938
  • [3] Deep Reinforcement Learning-Empowered Trajectory and Resource Allocation Optimization for UAV-Assisted MEC Systems
    Sun, Haowen
    Chen, Ming
    Pan, Yijin
    Cang, Yihan
    Zhao, Jiahui
    Sun, Yuanzhi
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (07) : 1823 - 1827
  • [4] Joint UAV Deployment and Resource Allocation: A Personalized Federated Deep Reinforcement Learning Approach
    Xu, Xinyi
    Feng, Gang
    Qin, Shuang
    Liu, Yijing
    Sun, Yao
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (03) : 4005 - 4018
  • [5] Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications
    Zhang, Chiya
    Li, Zhukun
    He, Chunlong
    Wang, Kezhi
    Pan, Cunhua
    [J]. IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2398 - 2402
  • [6] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [7] Deep Reinforcement Learning for Jointly Resource Allocation and Trajectory Planning in UAV-Assisted Networks
    Jwaifel, Arwa Mahmoud
    Van Do, Tien
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2023, 2023, 14162 : 71 - 83
  • [8] Joint Resource Allocation and Trajectory Optimization with QoS in NOMA UAV Networks
    Li, Yabo
    Zhang, Haijun
    Long, Keping
    Jiang, Chunxiao
    Guizani, Mohsen
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [9] Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks
    Zhao, Nan
    Cheng, Yiqiang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    [J]. ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [10] Deep Reinforcement Learning Based Resource Allocation and Trajectory Planning in Integrated Sensing and Communications UAV Network
    Qin, Yunhui
    Zhang, Zhongshan
    Li, Xulong
    Wei Huangfu
    Zhang, Haijun
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (11) : 8158 - 8169