Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications

被引：0

作者：

Qin, Yunhui ^{[1
]}

Xing, Zhifang ^{[1
,3
]}

Li, Xulong ^{[2
]}

Zhang, Zhongshan ^{[3
]}

Zhang, Haijun ^{[2
]}

机构：

[1] Univ Sci & Technol Beijing, Natl Sch Elite Engn, Beijing 100083, Peoples R China

[2] Univ Sci & Technol Beijing, Beijing Engn & Technol Res Ctr Convergence Network, Beijing, Peoples R China

[3] Beijing Inst Technol, Sch Cyberspace Sci & Technol, Beijing 100081, Peoples R China

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 12期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Autonomous aerial vehicles; Jamming; Optimization; Trajectory; Resource management; Security; Communication system security; Unmanned aerial vehicle (UAV); periodic coverage evaluation; primal-dual optimization; deep reinforcement learning; constrained Markov decision process; RESOURCE-ALLOCATION; TRAJECTORY DESIGN; SECRECY; ENERGY;

D O I：

10.1109/TVT.2024.3450956

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Considering the UAVs' energy constraints and green communication requirements, this paper proposes a periodic coverage-assisted UAV secure communication system to maximize the worst-case average achievable secrecy rate.UAV base stations serve legitimate users while UAV jammers periodically dispatch interference signals to eavesdroppers. User scheduling, UAV trajectory and power allocation are modeled as a constrained Markov decision problem with coverage evaluation constraint. Then, the joint optimization of user scheduling, UAV trajectory and power allocation is achieved by the primal-dual soft actor-critic (SAC) algorithm. Specifically, the reward critic network assesses the secrecy rate and the cost critic network fits the coverage constraint. Meanwhile, the actor network generates the user scheduling, UAV trajectory and power allocation policy while updating the dual variables. For comparison, we also adopt other deep reinforcement learning (DRL) solutions namely the SAC algorithm and the twin-delayed deep deterministic policy gradient (TD3) as well as the traditional random method and greedy method. Simulation results show that the proposed algorithm performs best in the training speed, the reward performance and the secrecy rate.

引用

页码：19641 / 19652

页数：12

共 50 条

[31] RIS-Assisted UAV-D2D Communications Exploiting Deep Reinforcement Learning
YOU Qian
XU Qian
YANG Xin
ZHANG Tao
CHEN Ming
ZTE Communications, 2023, 21 (02) : 61 - 69
[32] Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
Ding, Yuhao
Lavaei, Javad
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7396 - 7404
[33] Hybrid IRS-Assisted Secure Satellite Downlink Communications: A Fast Deep Reinforcement Learning Approach
Ngo, Quynh Tu
Phan, Khoa Tran
Mahmood, Abdun
Xiang, Wei
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2858 - 2869
[34] A Dual Deep Network Based Secure Deep Reinforcement Learning Method
Zhu F.
Wu W.
Fu Y.-C.
Liu Q.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (08): : 1812 - 1826
[35] Deep Reinforcement Learning for UAV-Assisted Emergency Response
Lee, Isabella
Babu, Vignesh
Caesar, Matthew
Nicol, David
PROCEEDINGS OF THE 17TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2020), 2021, : 327 - 336
[36] Energy Harvesting UAV-RIS-Assisted Maritime Communications Based on Deep Reinforcement Learning Against Jamming
Yang, Helin
Lin, Kailong
Xiao, Liang
Zhao, Yifeng
Xiong, Zehui
Han, Zhu
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 9854 - 9868
[37] Learning for Path Planning and Coverage Mapping in UAV-Assisted Emergency Communications
Steiger, Juaren
Lu, Ning
Sorour, Sameh
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[38] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
Liu, Jintao
Zeng, Feng
Wang, Wei
Sheng, Zhichao
Wei, Xinchen
Cumanan, Kanapathippillai
CHINA COMMUNICATIONS, 2022, 19 (09) : 26 - 36
[39] Trajectory Design for UAV-Enabled Maritime Secure Communications: A Reinforcement Learning Approach
Jintao Liu
Feng Zeng
Wei Wang
Zhichao Sheng
Xinchen Wei
Kanapathippillai Cumanan
China Communications, 2022, 19 (09) : 26 - 36
[40] Reinforcement Learning Based Dual-UAV Trajectory Optimization for Secure Communication
Qian, Zhouyi
Deng, Zhixiang
Cai, Changchun
Li, Haochen
ELECTRONICS, 2023, 12 (09)

← 1 2 3 4 5 →