Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making

被引:3
|
作者
Hou, Yueqi [1 ,2 ]
Liang, Xiaolong [1 ,2 ]
Lv, Maolong [1 ]
Yang, Qisong [3 ]
Li, Yang [3 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian, Peoples R China
[2] Air Force Engn Univ, Shaanxi Key Lab Meta Synth Elect & Informat Syst, Xian, Peoples R China
[3] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Delft, Netherlands
关键词
Unmanned Aerial Vehicle; Maneuver decision-making; Reinforcement learning; Curriculum learning; Knowledge transfer; STRATEGY;
D O I
10.1016/j.engappai.2023.106703
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned Aerial Vehicle (UAV) maneuver strategy learning remains a challenge when using Reinforcement Learning (RL) in this sparse reward task. In this paper, we propose Subtask-Masked curriculum learning for RL (SubMas-RL), an efficient RL paradigm that implements curriculum learning and knowledge transfer for UAV maneuver scenarios involving multiple missiles. First, this study introduces a novel concept known as subtask mask to create source tasks from a target task by masking partial subtasks. Then, a subtask-masked curriculum generation method is proposed to generate a sequenced curriculum by alternately conducting task generation and task sequencing. To establish efficient knowledge transfer and avoid negative transfer, this paper employs two transfer techniques, policy distillation and policy reuse, along with an explicit transfer condition that masks irrelevant knowledge. Experimental results demonstrate that our method achieves a 94.8% success rate in the UAV maneuver scenario, where the direct use of reinforcement learning always fails. The proposed RL framework SubMas-RL is expected to learn an effective policy in complex tasks with sparse rewards.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] UAV swarm air combat maneuver decision-making method based on multi-agent reinforcement learning and transferring
    Zhiqiang ZHENG
    Chen WEI
    Haibin DUAN
    [J]. Science China(Information Sciences), 2024, 67 (08) - 66
  • [12] Reinforcement learning with hierarchical decision-making
    Cohen, Shahar
    Maimon, Oded
    Khmlenitsky, Evgeni
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
  • [13] Decision analysis and reinforcement learning in surgical decision-making
    Loftus, Tyler J.
    Filiberto, Amanda C.
    Li, Yanjun
    Balch, Jeremy
    Cook, Allyson C.
    Tighe, Patrick J.
    Efron, Philip A.
    Upchurch, Gilbert R., Jr.
    Rashidi, Parisa
    Li, Xiaolin
    Bihorac, Azra
    [J]. SURGERY, 2020, 168 (02) : 253 - 266
  • [14] REINFORCEMENT LEARNING FOR DECISION-MAKING IN A BUSINESS SIMULATOR
    Garcia, Javier
    Borrajo, Fernando
    Fernandez, Fernando
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2012, 11 (05) : 935 - 960
  • [15] Application of Reinforcement Learning in Decision-Making Management of Intelligent Unmanned System
    Wei, Ning
    Wang, Guan
    [J]. Binggong Xuebao/Acta Armamentarii, 2022, 43 : 164 - 169
  • [16] Transformer in reinforcement learning for decision-making: a survey
    Yuan, Weilin
    Chen, Jiaxing
    Chen, Shaofei
    Feng, Dawei
    Hu, Zhenzhen
    Li, Peng
    Zhao, Weiwei
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
  • [17] Collaborative Decision-Making Method for Multi-UAV Based on Multiagent Reinforcement Learning
    Li, Shaowei
    Jia, Yuhong
    Yang, Fan
    Qin, Qingyang
    Gao, Hui
    Zhou, Yaoming
    [J]. IEEE ACCESS, 2022, 10 : 91385 - 91396
  • [18] Autonomous maneuver decision-making method based on reinforcement learning and Monte Carlo tree search
    Zhang, Hongpeng
    Zhou, Huan
    Wei, Yujie
    Huang, Changqiang
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [19] Intelligent Maneuver Decision Method of UAV based on Reinforcement Learning and Neural Network
    Thou, Huan
    Zhang, Senyu
    Sun, Chu
    Ru, Changjian
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8544 - 8549
  • [20] Autonomous Maneuver Decision Making of Dual-UAV Cooperative Air Combat Based on Deep Reinforcement Learning
    Hu, Jinwen
    Wang, Luhe
    Hu, Tianmi
    Guo, Chubing
    Wang, Yanxiong
    [J]. ELECTRONICS, 2022, 11 (03)