Autonomous Maneuver Decision-Making Through Curriculum Learning and Reinforcement Learning With Sparse Rewards

被引:0
|
作者
Wei, Yujie [1 ,2 ]
Zhang, Hongpeng [1 ]
Wang, Yuan [1 ]
Huang, Changqiang [1 ]
机构
[1] Air Force Engn Univ, Inst Aeronaut Engn, Xian 710038, Peoples R China
[2] Air Force Xian Flying Coll, Xian 710300, Peoples R China
关键词
Maneuver decision-making; curriculum learning; reinforcement learning; sparse rewards; ALGORITHMS; NETWORKS;
D O I
10.1109/ACCESS.2023.3297095
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning is an effective approach for solving decision-making problems. However, when using reinforcement learning to solve maneuver decision-making with sparse rewards, it costs too much time for training, and the final performance may not be satisfactory. In order to overcome the shortcomings, the method for maneuver decision-making based on curriculum learning and reinforcement learning is proposed. First, three curricula are designed to address the maneuver decision-making problem: angle curriculum, distance curriculum and hybrid curriculum. They are proposed according to the intuition that closer destinations are easier to arrive at. Then, they are used to train agents and compared with the original method without any curriculum. The training results show that angle curriculum can increase the speed and stability of training, and improve the performance of maneuver decision-making; distance curriculum can increase the speed and stability of agent training; hybrid curriculum is not better than the other curricula, because it makes the agent get stuck at the local optimum. The simulation results show that after training, the agent can handle the situations where targets come from different directions, and the maneuver decision-makings are rational, effective, and interpretable, whereas the method without curriculum is invalid.
引用
收藏
页码:73543 / 73555
页数:13
相关论文
共 50 条
  • [21] Review of Autonomous Driving Decision-Making Research Based on Reinforcement Learning
    Jin L.
    Han G.
    Xie X.
    Guo B.
    Liu G.
    Zhu W.
    [J]. Qiche Gongcheng/Automotive Engineering, 2023, 45 (04): : 527 - 540
  • [22] Reinforcement learning with hierarchical decision-making
    Cohen, Shahar
    Maimon, Oded
    Khmlenitsky, Evgeni
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
  • [23] Intermittent Reinforcement Learning with Sparse Rewards
    Sahoo, Prachi Pratyusha
    Vamvoudakis, Kyriakos G.
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2709 - 2714
  • [24] Decision-Making in Fallback Scenarios for Autonomous Vehicles: Deep Reinforcement Learning Approach
    Lee, Cheonghwa
    An, Dawn
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [25] Autonomous Decision-Making for Aerobraking via Parallel Randomized Deep Reinforcement Learning
    Falcone, Giusy
    Putnam, Zachary R. R.
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2023, 59 (03) : 3055 - 3070
  • [26] A Decision-Making Model for Autonomous Vehicles at Intersections Based on Hierarchical Reinforcement Learning
    Chen, Xue-Mei
    Xu, Shu-Yuan
    Wang, Zi-Jia
    Zheng, Xue-Long
    Han, Xin-Tong
    Liu, En-Hao
    [J]. UNMANNED SYSTEMS, 2024, 12 (04) : 641 - 652
  • [27] Decision-Making Strategy on Highway for Autonomous Vehicles Using Deep Reinforcement Learning
    Liao, Jiangdong
    Liu, Teng
    Tang, Xiaolin
    Mu, Xingyu
    Huang, Bing
    Cao, Dongpu
    [J]. IEEE ACCESS, 2020, 8 (08): : 177804 - 177814
  • [28] A Decision-making Method for Longitudinal Autonomous Driving Based on Inverse Reinforcement Learning
    Gao Z.
    Yan X.
    Gao F.
    [J]. Qiche Gongcheng/Automotive Engineering, 2022, 44 (07): : 969 - 975
  • [29] Random Prior Network for Autonomous Driving Decision-Making Based on Reinforcement Learning
    Qiang, Yuchuan
    Wang, Xiaolan
    Wang, Yansong
    Zhang, Weiwei
    Xu, Jianxun
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (04)
  • [30] Research on decision-making of autonomous vehicle following based on reinforcement learning method
    Gao, Hongbo
    Shi, Guanya
    Wang, Kelong
    Xie, Guotao
    Liu, Yuchao
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2019, 46 (03): : 444 - 452