Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards

被引:0
|
作者
Wei, Yujie [1 ,2 ]
Zhang, Hongpeng [1 ]
Wang, Yuan [1 ]
Huang, Changqiang [1 ]
机构
[1] Air Force Engn Univ, Inst Aeronaut Engn, Xian 710038, Peoples R China
[2] Air Force Xian Flying Coll, Xian 710300, Peoples R China
关键词
Maneuver decision-making; curriculum learning; reinforcement learning; sparse rewards; ALGORITHMS; NETWORKS;
D O I
10.1109/ACCESS.2023.3297095
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is an effective approach for solving decision-making problems. However, when using reinforcement learning to solve maneuver decision-making with sparse rewards, it costs too much time for training, and the final performance may not be satisfactory. In order to overcome the shortcomings, the method for maneuver decision-making based on curriculum learning and reinforcement learning is proposed. First, three curricula are designed to address the maneuver decision-making problem: angle curriculum, distance curriculum and hybrid curriculum. They are proposed according to the intuition that closer destinations are easier to arrive at. Then, they are used to train agents and compared with the original method without any curriculum. The training results show that angle curriculum can increase the speed and stability of training, and improve the performance of maneuver decision-making; distance curriculum can increase the speed and stability of agent training; hybrid curriculum is not better than the other curricula, because it makes the agent get stuck at the local optimum. The simulation results show that after training, the agent can handle the situations where targets come from different directions, and the maneuver decision-makings are rational, effective, and interpretable, whereas the method without curriculum is invalid.
引用
收藏
页码:73543 / 73555
页数:13
相关论文
共 50 条
  • [1] Maneuver Decision-Making through Automatic Curriculum Reinforcement Learning without Handcrafted Reward Functions
    Wei, Yujie
    Zhang, Hongpeng
    Wang, Yuan
    Huang, Changqiang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [2] Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making
    Hou, Yueqi
    Liang, Xiaolong
    Lv, Maolong
    Yang, Qisong
    Li, Yang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [3] A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning
    Li, Ke
    Zhang, Kun
    Zhang, Zhenchong
    Liu, Zekun
    Hua, Shuai
    He, Jianliang
    [J]. SENSORS, 2021, 21 (06)
  • [4] Autonomous maneuver decision-making method based on reinforcement learning and Monte Carlo tree search
    Zhang, Hongpeng
    Zhou, Huan
    Wei, Yujie
    Huang, Changqiang
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [5] UAVs Maneuver Decision-Making Method Based on Transfer Reinforcement Learning
    Zhu, Jindong
    Fu, Xiaowei
    Qiao, Zhe
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022 : 2399796
  • [6] Research on Air Confrontation Maneuver Decision-Making Method Based on Reinforcement Learning
    Zhang, Xianbing
    Liu, Guoqing
    Yang, Chaojie
    Wu, Jiang
    [J]. ELECTRONICS, 2018, 7 (11):
  • [7] Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty Estimation
    Hoel, Carl-Johan
    Wolff, Krister
    Laine, Leo
    [J]. 2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1563 - 1569
  • [8] A DECISION-MAKING METHOD FOR AUTONOMOUS VEHICLES BASED ON SIMULATION AND REINFORCEMENT LEARNING
    Zheng, Rui
    Liu, Chunming
    Guo, Qi
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 362 - 369
  • [9] Constraints Driven Safe Reinforcement Learning for Autonomous Driving Decision-Making
    Gao, Fei
    Wang, Xiaodong
    Fan, Yuze
    Gao, Zhenhai
    Zhao, Rui
    [J]. IEEE ACCESS, 2024, 12 : 128007 - 128023
  • [10] Reinforcement Learning Based Overtaking Decision-Making for Highway Autonomous Driving
    Li, Xin
    Xu, Xin
    Zuo, Lei
    [J]. 2015 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2015, : 336 - 342