Maneuver Decision-Making for Autonomous Air Combat Based on FRE-PPO

被引:9
|
作者
Zhang, Hongpeng [1 ]
Wei, Yujie [1 ]
Zhou, Huan [1 ]
Huang, Changqiang [1 ]
机构
[1] Air Force Engn Univ, Aeronaut Engn Coll, Xian 710038, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 20期
基金
中国国家自然科学基金;
关键词
autonomous air combat; maneuver decision-making; reinforcement learning; final reward estimation; proximal policy optimization; GO; LEVEL; GAME;
D O I
10.3390/app122010230
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Maneuver decision-making is the core of autonomous air combat, and reinforcement learning is a potential and ideal approach for addressing decision-making problems. However, when reinforcement learning is used for maneuver decision-making for autonomous air combat, it often suffers from awful training efficiency and poor performance of maneuver decision-making. In this paper, an air combat maneuver decision-making method based on final reward estimation and proximal policy optimization is proposed to solve the above problems. First, an air combat environment based on aircraft and missile models is constructed, and an intermediate reward and final reward are designed. Second, the final reward estimation is proposed to replace the original advantage estimation function of the surrogate objective of proximal policy optimization to improve the training performance of reinforcement learning. Third, sampling according to the final reward estimation is proposed to improve the training efficiency. Finally, the proposed method is used in a self-play framework to train agents for maneuver decision-making. Simulations show that final reward estimation and sampling according to final reward estimation are effective and efficient.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Autonomous Air Combat Maneuver Decision-Making Based on PPO-BWDA
    Wang, Hongming
    Zhou, Zhuangfeng
    Jiang, Junzhe
    Deng, Wenqin
    Chen, Xueyun
    [J]. IEEE ACCESS, 2024, 12 : 119116 - 119132
  • [2] Maneuver Decision-making on Air-to-Air Combat Via Hybrid Control
    He, Fenghua
    Yao, Yu
    [J]. 2010 IEEE AEROSPACE CONFERENCE PROCEEDINGS, 2010,
  • [3] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
    Li Bo
    Liang Shiyang
    Chen Daqing
    Li Xitong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2022, 31 (01) : 107 - 115
  • [4] A Decision-Making Method for Air Combat Maneuver Based on Hybrid Deep Learning Network
    LI Bo
    LIANG Shiyang
    CHEN Daqing
    LI Xitong
    [J]. Chinese Journal of Electronics, 2022, 31 (01) : 107 - 115
  • [5] Air combat maneuver decision-making based on improved symbiotic organisms search algorithm
    Gao, Yangyang
    Yu, Minjian
    Han, Qisong
    Dong, Xiaojie
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (03): : 429 - 436
  • [6] Situational continuity-based air combat autonomous maneuvering decision-making
    Zhang, Jian-dong
    Yu, Yi-fei
    Zheng, Li-hui
    Yang, Qi-ming
    Shi, Guo-qing
    Wu, Yong
    [J]. DEFENCE TECHNOLOGY, 2023, 29 : 66 - 79
  • [7] Situational continuity-based air combat autonomous maneuvering decision-making
    Jian-dong Zhang
    Yi-fei Yu
    Li-hui Zheng
    Qi-ming Yang
    Guo-qing Shi
    Yong Wu
    [J]. Defence Technology, 2023, 29 (11) : 66 - 79
  • [8] UAV Air Combat Autonomous Maneuver Decision Based on DDPG Algorithm
    Yang, Qiming
    Zhu, Yan
    Zhang, Jiandong
    Qiao, Shasha
    Liu, Jieling
    [J]. 2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, : 37 - 42
  • [9] Research on Autonomous Decision-Making in Manned/Unmanned Coordinated Air Combat
    Dou, Xiangming
    Tang, Guojian
    Zheng, Aoyu
    Wang, Han
    Liang, Xiaolong
    [J]. 2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, : 170 - 178
  • [10] Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning
    Hu, Jinwen
    Wang, Luhe
    Hu, Tianmi
    Guo, Chubing
    Wang, Yanxiong
    [J]. ELECTRONICS, 2022, 11 (03)