An Intelligent Strategy Decision Method for Collaborative Jamming Based on Hierarchical Multi-Agent Reinforcement Learning

被引:0
|
作者
Zhang, Wenxu [1 ,2 ,3 ]
Zhao, Tong [1 ,2 ,3 ]
Zhao, Zhongkai [1 ,2 ,3 ]
Wang, Yajie [1 ,2 ,3 ]
Liu, Feiran [4 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[2] Harbin Engn Univ, Key Lab Adv Marine Commun & Informat Technol, Minist Ind & Informat Technol, Harbin 150001, Heilongjiang, Peoples R China
[3] Harbin Engn Univ, AVIC United Technol Ctr Electromagnet Spectrum Col, Harbin 150001, Heilongjiang, Peoples R China
[4] Wright State Univ, Dept Elect Engn, Dayton, OH 45435 USA
关键词
Jamming; Decision making; Radar; Frequency diversity; Reinforcement learning; Training; Time-frequency analysis; Cooperative jamming decision-making; resource allocation; hierarchical reinforcement learning; multi-agent reinforcement learning; prioritized experience replay; WAVE-FORM; RADAR; ALLOCATION;
D O I
10.1109/TCCN.2024.3373640
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of intelligent cooperative jamming decision-making against frequency agility and frequency diversity in cognitive electronic warfare, an intelligent cooperative jamming strategy decision-making method based on hierarchical multi-agent reinforcement learning is proposed. The multi-agent markov decision process (MDP) is used to construct the multi-jammer cooperative decision-making process. The cooperative jamming decision-making in frequency domain (FD-CJDM) model is established. The design idea of hierarchical reinforcement learning (HRL) is introduced. In order to find the optimal strategy, a double-depth Q-network based on the prioritized experience replay (PER-DDQN) optimization method of sum tree structure is adopted. The performance of FD-CJDM model based on PER-DDQN is simulated. Simulation results show that the proposed PER-DDQN method is obviously superior to deep Q network (DQN) method in action estimation, and its convergence performance is faster than that of double-depth Q network (DDQN). In addition, the intelligent decision-making method of cooperative jamming proposed in this paper can fomulate the frequency domain parameter decision-making strategy according to the order of real-time detected radar threats, which effectively realizes the design of intelligent decision-making in frequency domain.
引用
收藏
页码:1467 / 1480
页数:14
相关论文
共 50 条
  • [1] An Intelligent Strategy Decision Method for Collaborative Jamming Based On Hierarchical Multi-Agent Reinforcement Learning
    Zhang, Wenxu
    Zhao, Tong
    Zhao, Zhongkai
    Wang, Yajie
    Liu, Feiran
    IEEE Transactions on Cognitive Communications and Networking, 1600, (1-1):
  • [2] A cooperative jamming decision-making method based on multi-agent reinforcement learning
    Bingchen Cai
    Haoran Li
    Naimin Zhang
    Mingyu Cao
    Han Yu
    Autonomous Intelligent Systems, 5 (1):
  • [3] Hierarchical Architecture for Multi-Agent Reinforcement Learning in Intelligent Game
    Li, Bin
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] An Intelligent Financial Investment Decision Model Based on Multi-Agent Reinforcement Learning
    Guo, Qi
    Li, Bohan
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (15)
  • [5] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229
  • [6] Collaborative Intelligent Reflecting Surface Networks With Multi-Agent Reinforcement Learning
    Zhang, Jie
    Li, Jun
    Zhang, Yijin
    Wu, Qingqing
    Wu, Xiongwei
    Shu, Feng
    Jin, Shi
    Chen, Wen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (03) : 532 - 545
  • [7] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [8] Collaborative Intelligent Reflecting Surface Networks With Multi-Agent Reinforcement Learning
    Zhang, Jie
    Li, Jun
    Zhang, Yijin
    Wu, Qingqing
    Wu, Xiongwei
    Shu, Feng
    Jin, Shi
    Chen, Wen
    IEEE Journal on Selected Topics in Signal Processing, 2022, 16 (03): : 532 - 545
  • [9] Hierarchical Multi-Agent Training Based on Reinforcement Learning
    Wang, Guanghua
    Li, Wenjie
    Wu, Zhanghua
    Guo, Xian
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
  • [10] A method of network attack-defense game and collaborative defense decision-making based on hierarchical multi-agent reinforcement learning
    Tang, Yunlong
    Sun, Jing
    Wang, Huan
    Deng, Junyi
    Tong, Liang
    Xu, Wenhong
    COMPUTERS & SECURITY, 2024, 142