Research on Target Defense Strategy Based on Deep Reinforcement Learning

被引:3
|
作者
Luo, Yuelin [1 ]
Gang, Tieqiang [1 ]
Chen, Lijie [1 ]
机构
[1] Xiamen Univ, Sch Aerosp Engn, Xiamen 361000, Peoples R China
关键词
Games; Reinforcement learning; Real-time systems; Markov processes; Deep learning; Heuristic algorithms; Differential games; Targeting; ADT game; target defense problem; deep reinforcement learning; DDPG; PURSUIT;
D O I
10.1109/ACCESS.2022.3179373
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Considering the natural advantages of deep reinforcement learning algorithms in dealing with continuous control problems, especially for dynamic interactions, these algorithms can be applied to solve the Attacker-Defender-Target (ADT) game problem. In this paper, the deep deterministic policy gradient (DDPG) and the multiagent DDPG algorithm are employed to solve the issue of target defense in the ADT game. By introducing an angle between the attacker-target line of sight and the attacker-defender line, we modify the reward function in the deep reinforcement learning algorithm, and redefine the corresponding state space and action space. Through several numerical experiments, the validity of the modified reward function is obvious that the modified defender's reward function improves the defender's strategic performance in the game. Compared with the traditional differential game theory, the DDPG and multiagent DDPG algorithms with the modified reward function can realize real-time decision-making and improve the flexibility of defenders in the confrontation process.
引用
收藏
页码:82329 / 82335
页数:7
相关论文
共 50 条
  • [21] Optimal Defense Strategy Selection Algorithm Based on Reinforcement Learning and Opposition-Based Learning
    Yue, Yiqun
    Zhou, Yang
    Xu, Lijuan
    Zhao, Dawei
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [22] Impact of Subjectivity in Deep Reinforcement Learning based Defense of Cloud Storage
    Aref, Zahra
    Mandayam, Narayan B.
    [J]. IEEE INFOCOM 2022 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2022,
  • [23] Research on carbon asset trading strategy based on PSO-VMD and deep reinforcement learning
    Zhang, Jiayang
    Chen, Kaijie
    [J]. JOURNAL OF CLEANER PRODUCTION, 2024, 435
  • [24] Maritime platform defense with deep reinforcement learning
    Markowitz, Jared
    Sheffield, Ryan
    Mullins, Galen
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [25] Research on Retrieval Ranking Based on Deep Reinforcement Learning
    Zhang, Kun
    Lin, Min
    Li, Yanling
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 504 - 510
  • [26] Ensemble Strategy Based on Deep Reinforcement Learning for Portfolio Optimization
    Su, Xiao
    Zhou, Yalan
    He, Shanshan
    Li, Xiangxia
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 242 - 249
  • [27] Control strategy of hydraulic cylinder based on Deep Reinforcement Learning
    Wyrwal, Daniel
    Lindner, Tymoteusz
    Nowak, Patryk
    Bialek, Marcin
    [J]. 15TH INTERNATIONAL CONFERENCE MECHATRONIC SYSTEMS AND MATERIALS, MSM'20, 2020, : 169 - 173
  • [28] A Control Strategy of Autonomous Vehicles based on Deep Reinforcement Learning
    Xia, Wei
    Li, Huiyun
    Li, Baopu
    [J]. PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 198 - 201
  • [29] Bitcoin transaction strategy construction based on deep reinforcement learning
    Liu, Fengrui
    Li, Yang
    Li, Baitong
    Li, Jiaxin
    Xie, Huiyang
    [J]. APPLIED SOFT COMPUTING, 2021, 113
  • [30] Deep-reinforcement-learning-based water diversion strategy
    Jiang, Qingsong
    Li, Jincheng
    Sun, Yanxin
    Huang, Jilin
    Zou, Rui
    Ma, Wenjing
    Guo, Huaicheng
    Wang, Zhiyun
    Liu, Yong
    [J]. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY, 2024, 17