Beyond-Visual-Range Air Combat Tactics Auto-Generation by Reinforcement Learning

被引:21
|
作者
Piao, Haiyin [1 ]
Sun, Zhixiao [2 ]
Meng, Guanglei [3 ]
Chen, Hechang [4 ]
Qu, Bohao [4 ]
Lang, Kuijun [5 ]
Sun, Yang [5 ]
Yang, Shengqi [5 ]
Peng, Xuanqi [5 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian, Peoples R China
[2] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian, Peoples R China
[3] Shenyang Aerosp Univ, Sch Automat, Shenyang, Peoples R China
[4] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
[5] SADRI Inst, Dept Elect Syst, Shenyang, Peoples R China
关键词
air combat; reinforcement learning; LEVEL; GAME; GO;
D O I
10.1109/ijcnn48605.2020.9207088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For quite a long time, effective Beyond-Visual-Range (BVR) air combat tactics can only be discovered by human pilots in the actual combat process. However, due to the lack of actual combat opportunities, making new air combat tactics innovation was generally considered quite difficult. To address this challenge, we first introduced a solely end-to-end Reinforcement Learning (RL) approach for training competitive air combat agents with adversarial self-play from scratch in a high fidelity air combat simulation environment during training. Furthermore, a Key Air Combat Event Reward Shaping (KAERS) mechanism was proposed to provide sparse but objective shaped rewards beyond episodic win/lose signal to accelerate the initial machine learning process. Experimental results showed that multiple valuable air combat tactical behaviors emerged progressively. We hope this study could be extended to the future of air combat machine intelligence research.
引用
下载
收藏
页数:8
相关论文
共 50 条
  • [21] A Multi-UCAV cooperative occupation method based on weapon engagement zones for beyond-visual-range air combat
    Wei-hua Li
    Jing-ping Shi
    Yun-yan Wu
    Yue-ping Wang
    Yong-xi Lyu
    Defence Technology, 2022, 18 (06) : 1006 - 1022
  • [22] A Multi-UCAV cooperative occupation method based on weapon engagement zones for beyond-visual-range air combat
    Li, Wei-hua
    Shi, Jing-ping
    Wu, Yun-yan
    Wang, Yue-ping
    Lyu, Yong-xi
    DEFENCE TECHNOLOGY, 2022, 18 (06) : 1006 - 1022
  • [23] Machine Learning to Improve Situational Awareness in Beyond Visual Range Air Combat
    Dantas, Joao P. A.
    Maximo, Marcos R. O. A.
    Costa, Andre N.
    Geraldo, Diego
    Yoneyama, Takashi
    IEEE LATIN AMERICA TRANSACTIONS, 2022, 20 (08) : 2039 - 2045
  • [24] A Multi-UCAV Cooperative Decision-Making Method Based on an MAPPO Algorithm for Beyond-Visual-Range Air Combat
    Liu, Xiaoxiong
    Yin, Yi
    Su, Yuzhan
    Ming, Ruichen
    AEROSPACE, 2022, 9 (10)
  • [25] Mixed-Reward Multiagent Proximal Policy Optimization Method for Two-on-Two Beyond-Visual-Range Air Combat
    Peng, Haojie
    Li, Weihua
    Dai, Sifan
    Chen, Ruihai
    IEEE CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2024,
  • [26] Evasive Maneuver Strategy for UCAV in Beyond-Visual-Range Air Combat Based on Hierarchical Multi-Objective Evolutionary Algorithm
    Yang, Zhen
    Zhou, Deyun
    Piao, Haiyin
    Zhang, Kai
    Kong, Weiren
    Pan, Qian
    IEEE ACCESS, 2020, 8 : 46605 - 46623
  • [27] A Multi-Target Consensus-Based Auction Algorithm for Distributed Target Assignment in Cooperative Beyond-Visual-Range Air Combat
    Li, Weihua
    Lyu, Yongxi
    Dai, Sifan
    Chen, Huakun
    Shi, Jingping
    Li, Yongfeng
    AEROSPACE, 2022, 9 (09)
  • [28] Engagement Decision Support for Beyond Visual Range Air Combat
    Dantas, Joao P. A.
    Costa, Andre N.
    Geraldo, Diego
    Maximo, Marcos R. O. A.
    Yoneyama, Takashi
    2021 LATIN AMERICAN ROBOTICS SYMPOSIUM / 2021 BRAZILIAN SYMPOSIUM ON ROBOTICS / 2021 WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2021), 2021, : 96 - 101
  • [29] Auto-generation and configuration of visual modeling language editor
    School of Computer Science and Technology, Beijing University of Aeronautics and Astronautics, Beijing 100083, China
    Beijing Hangkong Hangtian Daxue Xuebao, 2006, 12 (1495-1498):
  • [30] Cross coordination of behavior clone and reinforcement learning for autonomous within-visual-range air combat
    Li, Lun
    Zhang, Xuebo
    Qian, Chenxu
    Zhao, Minghui
    Wang, Runhua
    NEUROCOMPUTING, 2024, 584