Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards

被引:0
|
作者
Wei, Yujie [1 ,2 ]
Zhang, Hongpeng [1 ]
Wang, Yuan [1 ]
Huang, Changqiang [1 ]
机构
[1] Air Force Engn Univ, Inst Aeronaut Engn, Xian 710038, Peoples R China
[2] Air Force Xian Flying Coll, Xian 710300, Peoples R China
关键词
Maneuver decision-making; curriculum learning; reinforcement learning; sparse rewards; ALGORITHMS; NETWORKS;
D O I
10.1109/ACCESS.2023.3297095
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is an effective approach for solving decision-making problems. However, when using reinforcement learning to solve maneuver decision-making with sparse rewards, it costs too much time for training, and the final performance may not be satisfactory. In order to overcome the shortcomings, the method for maneuver decision-making based on curriculum learning and reinforcement learning is proposed. First, three curricula are designed to address the maneuver decision-making problem: angle curriculum, distance curriculum and hybrid curriculum. They are proposed according to the intuition that closer destinations are easier to arrive at. Then, they are used to train agents and compared with the original method without any curriculum. The training results show that angle curriculum can increase the speed and stability of training, and improve the performance of maneuver decision-making; distance curriculum can increase the speed and stability of agent training; hybrid curriculum is not better than the other curricula, because it makes the agent get stuck at the local optimum. The simulation results show that after training, the agent can handle the situations where targets come from different directions, and the maneuver decision-makings are rational, effective, and interpretable, whereas the method without curriculum is invalid.
引用
收藏
页码:73543 / 73555
页数:13
相关论文
共 50 条
  • [31] Decision analysis and reinforcement learning in surgical decision-making
    Loftus, Tyler J.
    Filiberto, Amanda C.
    Li, Yanjun
    Balch, Jeremy
    Cook, Allyson C.
    Tighe, Patrick J.
    Efron, Philip A.
    Upchurch, Gilbert R., Jr.
    Rashidi, Parisa
    Li, Xiaolin
    Bihorac, Azra
    [J]. SURGERY, 2020, 168 (02) : 253 - 266
  • [32] Augmenting Reinforcement Learning With Transformer-Based Scene Representation Learning for Decision-Making of Autonomous Driving
    Liu, Haochen
    Huang, Zhiyu
    Mo, Xiaoyu
    Lv, Chen
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (03): : 4405 - 4421
  • [33] Autonomous Industrial Management via Reinforcement Learning Towards Self-Learning Agents for Decision-Making
    Espinosa-Leal, Leonardo
    Chapman, Anthony
    Westerlund, Magnus
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8427 - 8439
  • [34] REINFORCEMENT LEARNING FOR DECISION-MAKING IN A BUSINESS SIMULATOR
    Garcia, Javier
    Borrajo, Fernando
    Fernandez, Fernando
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2012, 11 (05) : 935 - 960
  • [35] Transformer in reinforcement learning for decision-making: a survey
    Yuan, Weilin
    Chen, Jiaxing
    Chen, Shaofei
    Feng, Dawei
    Hu, Zhenzhen
    Li, Peng
    Zhao, Weiwei
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
  • [36] Deep-Reinforcement-Learning-Based Autonomous UAV Navigation With Sparse Rewards
    Wang, Chao
    Wang, Jian
    Wang, Jingjing
    Zhang, Xudong
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07): : 6180 - 6190
  • [37] A Deep Reinforcement Learning Decision-Making Approach for Adaptive Cruise Control in Autonomous Vehicles
    Ghraizi, Dany
    Talj, Reine
    Francis, Clovis
    [J]. 2023 21ST INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR, 2023, : 71 - 78
  • [38] Multi-intent autonomous decision-making for air combat with deep reinforcement learning
    Luyu Jia
    Chengtao Cai
    Xingmei Wang
    Zhengkun Ding
    Junzheng Xu
    Kejun Wu
    Jiaqi Liu
    [J]. Applied Intelligence, 2023, 53 : 29076 - 29093
  • [39] An Autonomous Attack Decision-Making Method Based on Hierarchical Virtual Bayesian Reinforcement Learning
    Wang, Dinghan
    Zhang, Jiandong
    Yang, Qiming
    Liu, Jieling
    Shi, Guoqing
    Zhang, Yaozhong
    [J]. IEEE Transactions on Aerospace and Electronic Systems, 2024, 60 (05) : 7075 - 7088
  • [40] Knowledge-Embedded Deep Reinforcement Learning for Autonomous Network Decision-Making Algorithm
    Zhang, Yalin
    Gao, Hui
    Su, Xin
    Liu, Bei
    [J]. 2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,