Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making

被引:3
|
作者
Hou, Yueqi [1 ,2 ]
Liang, Xiaolong [1 ,2 ]
Lv, Maolong [1 ]
Yang, Qisong [3 ]
Li, Yang [3 ]
机构
[1] Air Force Engn Univ, Air Traff Control & Nav Sch, Xian, Peoples R China
[2] Air Force Engn Univ, Shaanxi Key Lab Meta Synth Elect & Informat Syst, Xian, Peoples R China
[3] Delft Univ Technol, Fac Elect Engn Math & Comp Sci, Delft, Netherlands
关键词
Unmanned Aerial Vehicle; Maneuver decision-making; Reinforcement learning; Curriculum learning; Knowledge transfer; STRATEGY;
D O I
10.1016/j.engappai.2023.106703
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unmanned Aerial Vehicle (UAV) maneuver strategy learning remains a challenge when using Reinforcement Learning (RL) in this sparse reward task. In this paper, we propose Subtask-Masked curriculum learning for RL (SubMas-RL), an efficient RL paradigm that implements curriculum learning and knowledge transfer for UAV maneuver scenarios involving multiple missiles. First, this study introduces a novel concept known as subtask mask to create source tasks from a target task by masking partial subtasks. Then, a subtask-masked curriculum generation method is proposed to generate a sequenced curriculum by alternately conducting task generation and task sequencing. To establish efficient knowledge transfer and avoid negative transfer, this paper employs two transfer techniques, policy distillation and policy reuse, along with an explicit transfer condition that masks irrelevant knowledge. Experimental results demonstrate that our method achieves a 94.8% success rate in the UAV maneuver scenario, where the direct use of reinforcement learning always fails. The proposed RL framework SubMas-RL is expected to learn an effective policy in complex tasks with sparse rewards.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A UAV Autonomous Maneuver Decision-Making Algorithm for Route Guidance
    Zhang, Kun
    Li, Ke
    He, Jianliang
    Shi, Haotian
    Wang, Yongting
    Niu, Chen
    [J]. 2020 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'20), 2020, : 17 - 25
  • [32] Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making
    Desai, Nishant
    Critch, Andrew
    Russell, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [33] Research on Decision-Making in Emotional Agent Based on Reinforcement Learning
    Feng Chao
    Chen Lin
    Jiang Kui
    Wei Zhonglin
    Zhai Bing
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1191 - 1194
  • [34] Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming
    Eberhardinger, Manuel
    Rupp, Florian
    Maucher, Johannes
    Maghsudi, Setareh
    [J]. ADVANCES IN SWARM INTELLIGENCE, PT I, ICSI 2024, 2024, 14788 : 349 - 365
  • [35] Intrusion Response Decision-making Method Based on Reinforcement Learning
    Yang, Jun-nan
    Zhang, Hong-qi
    Zhang, Chuan-fu
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORK AND ARTIFICIAL INTELLIGENCE (CNAI 2018), 2018, : 154 - 162
  • [36] Reinforcement Learning with Uncertainty Estimation for Tactical Decision-Making in Intersections
    Hoel, Carl-Johan
    Tram, Tommy
    Sjoberg, Jonas
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [37] SPACECRAFT DECISION-MAKING AUTONOMY USING DEEP REINFORCEMENT LEARNING
    Harris, Andrew
    Teil, Thibaud
    Schaub, Hanspeter
    [J]. SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1757 - 1775
  • [38] MONEYBARL: EXPLOITING PITCHER DECISION-MAKING USING REINFORCEMENT LEARNING
    Sidhu, Gagan
    Caffo, Brian
    [J]. ANNALS OF APPLIED STATISTICS, 2014, 8 (02): : 926 - 955
  • [39] Cognitive Reinforcement Learning: An Interpretable Decision-Making for Virtual Driver
    Qi, Hao
    Hou, Enguang
    Ye, Peijun
    [J]. IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2024, 8 : 627 - 631
  • [40] Decision-making models on perceptual uncertainty with distributional reinforcement learning
    Xu, Shuyuan
    Liu, Qiao
    Hu, Yuhui
    Xu, Mengtian
    Hao, Jiachen
    [J]. GREEN ENERGY AND INTELLIGENT TRANSPORTATION, 2023, 2 (02):