Deep Reinforcement Learning With Application to Air Confrontation Intelligent Decision-Making of Manned/Unmanned Aerial Vehicle Cooperative System

被引:23
|
作者
Li, Yue [1 ]
Han, Wei [1 ]
Wang, Yongqing [2 ]
机构
[1] Naval Aviat Univ, Coll Basic Sci Aviat, Yantai 264001, Peoples R China
[2] Shenyang Aircraft Design & Res Inst, Shenyang 110035, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
基金
中国国家自然科学基金;
关键词
Decision making; Cooperative systems; Mathematical model; Reinforcement learning; Atmospheric modeling; Optimization; Approximation algorithms; Manned; unmanned aerial vehicle; intelligent decision-making; application of deep reinforcement learning; intention guiding; deep deterministic policy gradient; self-learning; UAV;
D O I
10.1109/ACCESS.2020.2985576
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of intelligence in air confrontation, the demand for cooperative engagement of manned/unmanned aerial vehicle (MAV/UAV) is becoming more intense. Deep reinforcement learning (DRL), which combines the abstract representation capability of deep learning (DL) and the optimal decision-making and control capability of reinforcement learning (RL), is an appropriate application for dealing with this problem. In the case of continuous action space, the dynamics model of UAV and the basic structure of one of the most popular DRL methods called deep deterministic policy gradient (DDPG) are built firstly. To establish the framework of intelligent decision-making of MAV/UAV, typical intentions including Head-on attack, Fleeing, Pursuing and Energy-storing, corresponding to four optimization models, are introduced secondly. Then the neural network is trained by means of reconstructing the replay buffer of DDPG algorithm. Finally, simulation results show that UAV is able to learn intelligent decision-making throughout the intention guiding of MAV. Compared with original DDPG algorithm, the improved method can achieve a better performance in convergence and stability. Furthermore, the level of intelligent decision-making in air confrontation can be improved by self-learning.
引用
收藏
页码:67887 / 67898
页数:12
相关论文
共 50 条
  • [1] Application of Reinforcement Learning in Decision-Making Management of Intelligent Unmanned System
    Wei N.
    Wang G.
    [J]. Binggong Xuebao/Acta Armamentarii, 2022, 43 : 164 - 169
  • [2] Decision-making allocation method in manned/unmanned combat aerial vehicle cooperative engagement
    Zhong Y.
    Yao P.
    Sun Y.
    [J]. 1600, Systems Engineering Society of China (36): : 2984 - 2992
  • [3] Interactive cooperative decision-making of manned-unmanned aerial vehicle combat system based on HFCM
    Zhong Y.
    Yao P.
    Zhang J.
    Wu J.
    [J]. Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2021, 41 (10): : 2748 - 2760
  • [4] Unmanned Aerial Vehicle Swarm Cooperative Decision-Making for SEAD Mission: A Hierarchical Multiagent Reinforcement Learning Approach
    Yue, Longfei
    Yang, Rennong
    Zuo, Jialiang
    Zhang, Ying
    Li, Qiuni
    Zhang, Yijie
    [J]. IEEE ACCESS, 2022, 10 : 92177 - 92191
  • [5] Multi-Unmanned Aerial Vehicle Confrontation in Intelligent Air Combat: A Multi-Agent Deep Reinforcement Learning Approach
    Yang, Jianfeng
    Yang, Xinwei
    Yu, Tianqi
    [J]. DRONES, 2024, 8 (08)
  • [6] Intelligent route planning for cooperative striking of manned/unmanned aerial vehicle
    Luo W.-E.
    Wei R.-X.
    [J]. Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2019, 36 (07): : 1090 - 1095
  • [7] Cooperative mission control system for a manned vehicle and unmanned aerial vehicle
    Peng, Hui
    Xiang, Xiaojia
    Wu, Lizhen
    Zhu, Huayong
    [J]. Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2008, 29 (SUPPL.): : 135 - 141
  • [8] Application of Reinforcement Learning in Multiagent Intelligent Decision-Making
    Han, Xiaoyu
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] Intelligent Decision-making Modeling for Unmanned Aerial Vehicle Based on Probability Graphical Models
    Shi, Zhifu
    Guo, Yaohua
    Li, Yaxiong
    [J]. 2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 304 - 308
  • [10] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
    Zhang, Xianbing
    Liu, Guoqing
    Yang, Chaojie
    Wu, Jiang
    [J]. ELECTRONICS, 2018, 7 (11):