An Intelligent Strategy Decision Method for Collaborative Jamming Based on Hierarchical Multi-Agent Reinforcement Learning

被引:0
|
作者
Zhang, Wenxu [1 ,2 ,3 ]
Zhao, Tong [1 ,2 ,3 ]
Zhao, Zhongkai [1 ,2 ,3 ]
Wang, Yajie [1 ,2 ,3 ]
Liu, Feiran [4 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Marine Commun & Informat Technol, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[3] Harbin Engn Univ, AVIC United Technol Ctr Electromagnet Spectrum Col, Harbin 150001, Heilongjiang, Peoples R China
[4] Wright State Univ, Dept Elect Engn, Dayton, OH 45435 USA
关键词
Jamming; Decision making; Radar; Frequency diversity; Reinforcement learning; Training; Time-frequency analysis; Cooperative jamming decision-making; resource allocation; hierarchical reinforcement learning; multi-agent reinforcement learning; prioritized experience replay; WAVE-FORM; RADAR; ALLOCATION;
D O I
10.1109/TCCN.2024.3373640
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of intelligent cooperative jamming decision-making against frequency agility and frequency diversity in cognitive electronic warfare, an intelligent cooperative jamming strategy decision-making method based on hierarchical multi-agent reinforcement learning is proposed. The multi-agent markov decision process (MDP) is used to construct the multi-jammer cooperative decision-making process. The cooperative jamming decision-making in frequency domain (FD-CJDM) model is established. The design idea of hierarchical reinforcement learning (HRL) is introduced. In order to find the optimal strategy, a double-depth Q-network based on the prioritized experience replay (PER-DDQN) optimization method of sum tree structure is adopted. The performance of FD-CJDM model based on PER-DDQN is simulated. Simulation results show that the proposed PER-DDQN method is obviously superior to deep Q network (DQN) method in action estimation, and its convergence performance is faster than that of double-depth Q network (DDQN). In addition, the intelligent decision-making method of cooperative jamming proposed in this paper can fomulate the frequency domain parameter decision-making strategy according to the order of real-time detected radar threats, which effectively realizes the design of intelligent decision-making in frequency domain.
引用
收藏
页码:1467 / 1480
页数:14
相关论文
共 50 条
  • [21] UAV intelligent attack strategy generation model based on multi-agent game reinforcement learning
    Zhao Z.
    Cao L.
    Chen X.
    Lai J.
    Zhang L.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (10): : 3165 - 3171
  • [22] Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming
    Lv, Zefang
    Xiao, Liang
    Du, Yousong
    Niu, Guohang
    Xing, Chengwen
    Xu, Wenyuan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 9063 - 9075
  • [23] Intelligent Spectrum Sensing and Access With Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning
    Li, Xuanheng
    Zhang, Yulong
    Ding, Haichuan
    Fang, Yuguang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (04) : 3131 - 3145
  • [24] Constraint-based multi-agent reinforcement learning for collaborative tasks
    Shang, Xiumin
    Xu, Tengyu
    Karamouzas, Ioannis
    Kallmann, Marcelo
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
  • [25] Multi-agent Collaborative Fire Rescue Based on Deep Reinforcement Learning
    Feng, Yiming
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 1317 - 1321
  • [26] Hierarchical reinforcement learning based on multi-agent cooperation game theory
    Tang H.
    Dong C.
    International Journal of Wireless and Mobile Computing, 2019, 16 (04): : 369 - 376
  • [27] A multi-agent reinforcement learning anti-jamming method with partially overlapping channels
    Zhang, Yunpeng
    Jia, Luliang
    Qi, Nan
    Xu, Yifan
    Chen, Xueqiang
    IET COMMUNICATIONS, 2021, 15 (19) : 2461 - 2468
  • [28] Avoiding collaborative paradox in multi-agent reinforcement learning
    Kim, Hyunseok
    Kim, Seonghyun
    Lee, Donghun
    Jang, Ingook
    ETRI JOURNAL, 2021, 43 (06) : 1004 - 1012
  • [29] Studies on hierarchical reinforcement learning in multi-agent environment
    Yu Lasheng
    Marin, Alonso
    Hong Fei
    Lin Jian
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
  • [30] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008