UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning

被引：22

作者：

Li, Zhiwei ^{[1
]}

Lu, Yu ^{[2
]}

Li, Xi ^{[2
]}

Wang, Zengguang ^{[3
]}

Qiao, Wenxin ^{[2
]}

Liu, Yicen ^{[2
]}

机构：

[1] Army Engn Univ, UAV Engn Dept, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China

[2] Aircraft Maintenance Ctr, Shijiazhuang Campus, Yongji 044500, Peoples R China

[3] Natl Def Univ, Shijiazhuang 050000, Hebei, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 15期

关键词：

Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks; STACKELBERG GAME; TRANSMISSION;

D O I：

10.1109/JIOT.2021.3062659

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.

引用

页码：12289 / 12310

页数：22

共 50 条

[1] UAV Relay in VANETs Against Smart Jamming With Reinforcement Learning
Xiao, Liang
Lu, Xiaozhen
Xu, Dongjin
Tang, Yuliang
Wang, Lei
Zhuang, Weihua
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (05) : 4087 - 4097
[2] Reinforcement Learning Based UAV Swarm Communications Against Jamming
Lv, Zefang
Niu, Guohang
Xiao, Liang
Xing, Chengwen
Xu, Wenyuan
IEEE International Conference on Communications, 2023, 2023-May : 5204 - 5209
[3] Reinforcement Learning based UAV Swarm Communications Against Jamming
Lv, Zefang
Niu, Guohang
Xiao, Liang
Xing, Chengwen
Xu, Wenyuan
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5204 - 5209
[4] Knowledge-based recurrent neural networks in reinforcement learning
Le, Tien Dung
Komeda, Takashi
Takagi, Motoki
PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174
[5] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
Lv, Zefang
Xiao, Liang
Du, Yousong
Niu, Guohang
Xing, Chengwen
Xu, Wenyuan
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
[6] A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks
Li, Zhiwei
Lu, Yu
Shi, Yun
Wang, Zengguang
Qiao, Wenxin
Liu, Yicen
SYMMETRY-BASEL, 2019, 11 (05):
[7] Comparing Knowledge-Based Reinforcement Learning to Neural Networks in a Strategy Game
Nechepurenko, Liudmyla
Voss, Viktor
Gritsenko, Vyacheslav
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 312 - 328
[8] Distributed reinforcement learning based framework for energy-efficient UAV relay against jamming
Wang W.
Lv Z.
Lu X.
Zhang Y.
Xiao L.
Intelligent and Converged Networks, 2021, 2 (02): : 150 - 162
[9] UAV-AIDED CELLULAR COMMUNICATIONS WITH DEEP REINFORCEMENT LEARNING AGAINST JAMMING
Lu, Xiaozhen
Xiao, Liang
Dai, Canhuang
Dai, Huaiyu
IEEE WIRELESS COMMUNICATIONS, 2020, 27 (04) : 48 - 53
[10] UAV-Aided Cellular Communications with Deep Reinforcement Learning against Jamming
Lu, Xiaozhen
Xiao, Liang
Dai, Canhuang
Dai, Huaiyu
IEEE Wireless Communications, 2020, 27 (04): : 48 - 53

← 1 2 3 4 5 →