UAV Networks Against Multiple Maneuvering Smart Jamming With Knowledge-Based Reinforcement Learning

被引:22
|
作者
Li, Zhiwei [1 ]
Lu, Yu [2 ]
Li, Xi [2 ]
Wang, Zengguang [3 ]
Qiao, Wenxin [2 ]
Liu, Yicen [2 ]
机构
[1] Army Engn Univ, UAV Engn Dept, Shijiazhuang Campus, Shijiazhuang 050003, Hebei, Peoples R China
[2] Aircraft Maintenance Ctr, Shijiazhuang Campus, Yongji 044500, Peoples R China
[3] Natl Def Univ, Shijiazhuang 050000, Hebei, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2021年 / 8卷 / 15期
关键词
Jamming; Interference; Reinforcement learning; Games; Receivers; Signal to noise ratio; Convergence; Anti-jamming; domain knowledge; reinforcement learning (RL); unmanned aerial vehicle (UAV) networks; STACKELBERG GAME; TRANSMISSION;
D O I
10.1109/JIOT.2021.3062659
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The unmanned aerial vehicles (UAVs) networks are very vulnerable to smart jammers that can choose their jamming strategy based on the ongoing channel state accordingly. Although reinforcement learning (RL) algorithms can give UAV networks the ability to make intelligent decisions, the high-dimensional state space makes it difficult for algorithms to converge quickly. This article proposes a knowledge-based RL method, which uses domain knowledge to compress the state space that the agent needs to explore and then improve the algorithm convergence speed. Specifically, we use the inertial law of the aircraft and the law of signal attenuation in free space to guide the highly efficient exploration of the UAVs in the state space. We incorporate the performance indicators of the receiver and the subjective value of the task into the design of the reward function, and build a virtual environment for pretraining to accelerate the convergence of anti-jamming decisions. In addition, the algorithm proposed is completely based on observable data, which is more realistic than those studies that assume the position or the channel strategy of the jammer. The simulation shows that the proposed algorithm can outperform the benchmarks of model-free RL algorithm in terms of converge speed and averaged reward.
引用
收藏
页码:12289 / 12310
页数:22
相关论文
共 50 条
  • [1] UAV Relay in VANETs Against Smart Jamming With Reinforcement Learning
    Xiao, Liang
    Lu, Xiaozhen
    Xu, Dongjin
    Tang, Yuliang
    Wang, Lei
    Zhuang, Weihua
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (05) : 4087 - 4097
  • [2] Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Niu, Guohang
    Xiao, Liang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE International Conference on Communications, 2023, 2023-May : 5204 - 5209
  • [3] Reinforcement Learning based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Niu, Guohang
    Xiao, Liang
    Xing, Chengwen
    Xu, Wenyuan
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5204 - 5209
  • [4] Knowledge-based recurrent neural networks in reinforcement learning
    Le, Tien Dung
    Komeda, Takashi
    Takagi, Motoki
    PROCEDINGS OF THE 11TH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2007, : 169 - 174
  • [5] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [6] A Dyna-Q-Based Solution for UAV Networks Against Smart Jamming Attacks
    Li, Zhiwei
    Lu, Yu
    Shi, Yun
    Wang, Zengguang
    Qiao, Wenxin
    Liu, Yicen
    SYMMETRY-BASEL, 2019, 11 (05):
  • [7] Comparing Knowledge-Based Reinforcement Learning to Neural Networks in a Strategy Game
    Nechepurenko, Liudmyla
    Voss, Viktor
    Gritsenko, Vyacheslav
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 312 - 328
  • [8] Distributed reinforcement learning based framework for energy-efficient UAV relay against jamming
    Wang W.
    Lv Z.
    Lu X.
    Zhang Y.
    Xiao L.
    Intelligent and Converged Networks, 2021, 2 (02): : 150 - 162
  • [9] UAV-AIDED CELLULAR COMMUNICATIONS WITH DEEP REINFORCEMENT LEARNING AGAINST JAMMING
    Lu, Xiaozhen
    Xiao, Liang
    Dai, Canhuang
    Dai, Huaiyu
    IEEE WIRELESS COMMUNICATIONS, 2020, 27 (04) : 48 - 53
  • [10] UAV-Aided Cellular Communications with Deep Reinforcement Learning against Jamming
    Lu, Xiaozhen
    Xiao, Liang
    Dai, Canhuang
    Dai, Huaiyu
    IEEE Wireless Communications, 2020, 27 (04): : 48 - 53