Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications

被引：0

作者：

Qin, Yunhui ^{[1
]}

Xing, Zhifang ^{[1
,3
]}

Li, Xulong ^{[2
]}

Zhang, Zhongshan ^{[3
]}

Zhang, Haijun ^{[2
]}

机构：

[1] Univ Sci & Technol Beijing, Natl Sch Elite Engn, Beijing 100083, Peoples R China

[2] Univ Sci & Technol Beijing, Beijing Engn & Technol Res Ctr Convergence Network, Beijing, Peoples R China

[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 12期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Autonomous aerial vehicles; Jamming; Optimization; Trajectory; Resource management; Security; Communication system security; Unmanned aerial vehicle (UAV); periodic coverage evaluation; primal-dual optimization; deep reinforcement learning; constrained Markov decision process; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; SECRECY; ENERGY;

D O I：

10.1109/TVT.2024.3450956

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Considering the UAVs' energy constraints and green communication requirements, this paper proposes a periodic coverage-assisted UAV secure communication system to maximize the worst-case average achievable secrecy rate.UAV base stations serve legitimate users while UAV jammers periodically dispatch interference signals to eavesdroppers. User scheduling, UAV trajectory and power allocation are modeled as a constrained Markov decision problem with coverage evaluation constraint. Then, the joint optimization of user scheduling, UAV trajectory and power allocation is achieved by the primal-dual soft actor-critic (SAC) algorithm. Specifically, the reward critic network assesses the secrecy rate and the cost critic network fits the coverage constraint. Meanwhile, the actor network generates the user scheduling, UAV trajectory and power allocation policy while updating the dual variables. For comparison, we also adopt other deep reinforcement learning (DRL) solutions namely the SAC algorithm and the twin-delayed deep deterministic policy gradient (TD3) as well as the traditional random method and greedy method. Simulation results show that the proposed algorithm performs best in the training speed, the reward performance and the secrecy rate.

引用

页码：19641 / 19652

页数：12

共 50 条

[41] Primal-dual differential dynamic programming: A model-based reinforcement learning for constrained dynamic optimization
Kim, Jong Woo
Oh, Tae Hoon
Son, Sang Hwan
Lee, Jong Min
COMPUTERS & CHEMICAL ENGINEERING, 2022, 167
[42] Achieving Zero Constraint Violation for Concave Utility Constrained Reinforcement Learning via Primal-Dual Approach
Bai, Qinbo
Bedi, Amrit Singh
Agarwal, Mridul
Koppel, Alec
Aggarwal, Vaneet
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 78 : 975 - 1016
[43] Deep Learning for Secure UAV-Assisted RIS Communication Networks
Mughal U.A.
Alkhrijah Y.
Almadhor A.
Yuen C.
IEEE Internet of Things Magazine, 2024, 7 (02): : 38 - 44
[44] Two-Time Scale Tracking Control of Flexible Robots With Primal-Dual Inverse Reinforcement Learning
Que, Xuejie
Wang, Zhenlei
Zhang, Yanqi
Su, Guanghao
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
[45] Hybrid UAV-Enabled Secure Offloading via Deep Reinforcement Learning
Yoo, Seonghoon
Jeong, Seongah
Kang, Joonhyuk
IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (06) : 972 - 976
[46] Deep Reinforcement Learning Aided Trajectory and Power Control for Secure UAV Communication
Wang, Zhijian
Su, Gongchao
Chen, Bin
Dai, Mingjun
Lin, Xiaohui
PROCEEDINGS OF THE 2024 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS, ICWCSN 2024, 2024, : 74 - 79
[47] UAV-AIDED CELLULAR COMMUNICATIONS WITH DEEP REINFORCEMENT LEARNING AGAINST JAMMING
Lu, Xiaozhen
Xiao, Liang
Dai, Canhuang
Dai, Huaiyu
IEEE WIRELESS COMMUNICATIONS, 2020, 27 (04) : 48 - 53
[48] Trajectory Design for Overlay UAV-to-Device Communications by Deep Reinforcement Learning
Wu, Fanyi
Zhang, Hongliang
Wu, Jianjun
Song, Lingyang
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[49] Trajectory Design for UAV Communications with No-Fly Zones by Deep Reinforcement Learning
Liu, Zhenrong
Zeng, Yuan
Zhang, Wei
Gong, Yi
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[50] UAV-Aided Cellular Communications with Deep Reinforcement Learning against Jamming
Lu, Xiaozhen
Xiao, Liang
Dai, Canhuang
Dai, Huaiyu
IEEE Wireless Communications, 2020, 27 (04): : 48 - 53

← 1 2 3 4 5 →