Constraints Driven Safe Reinforcement Learning for Autonomous Driving Decision-Making

被引:0
|
作者
Gao, Fei [1 ,2 ]
Wang, Xiaodong [1 ]
Fan, Yuze [1 ]
Gao, Zhenhai [1 ,2 ]
Zhao, Rui [1 ]
机构
[1] Jilin Univ, Coll Automot Engn, Changchun 130025, Peoples R China
[2] Jilin Univ, Natl Key Lab Automot Chassis Integrat & Bion, Changchun 130025, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
美国国家科学基金会;
关键词
Autonomous vehicles; Safety; Road transportation; Decision making; Planning; Measurement; Accuracy; Autonomous driving; Reinforcement learning; constrained policy optimization; reinforcement learning;
D O I
10.1109/ACCESS.2024.3454249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although reinforcement learning (RL) methodologies exhibit potential in addressing decision-making and planning problems in autonomous driving, ensuring the safety of the vehicle under all circumstances remains a formidable challenge in practical applications. Current RL methods are predominantly driven by singular reward mechanisms, frequently encountering difficulties in balancing multiple sub-rewards such as safety, comfort, and efficiency. To address these limitations, this paper introduces a constraint-driven safety RL method, applied to decision-making and planning policy in highway scenarios. This method ensures decisions maximize performance rewards within the bounds of safety constraints, exhibiting exceptional robustness. Initially, the framework reformulates the autonomous driving decision-making problem as a Constrained Markov Decision Process (CMDP) within the safety RL framework. It then introduces a Multi-Level Safety-Constrained Policy Optimization (MLSCPO) method, incorporating a cost function to address safety constraints. Ultimately, simulated tests conducted within the CARLA environment demonstrate that the proposed method MLSCPO outperforms the current advanced safe reinforcement learning policy, Proximal Policy Optimization with Lagrangian (PPO-Lag) and the traditional stable longitudinal and lateral autonomous driving model, Intelligent Driver Model with Minimization of Overall Braking Induced by Lane Changes (IDM+MOBIL). Compared to the classic IDM+MOBIL method, the proposed approach not only achieves efficient driving but also offers a better driving experience. In comparison with the reinforcement learning method PPO-Lag, it significantly enhances safety while ensuring driving efficiency, achieving a zero-collision rate. In the future, we will integrate the aforementioned potential expansion plans to enhance the usability and generalization capabilities of the method in real-world applications.
引用
收藏
页码:128007 / 128023
页数:17
相关论文
共 50 条
  • [1] Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning
    Zhao, Rui
    Chen, Ziguo
    Fan, Yuze
    Li, Yun
    Gao, Fei
    SENSORS, 2024, 24 (13)
  • [2] Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty Estimation
    Hoel, Carl-Johan
    Wolff, Krister
    Laine, Leo
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1563 - 1569
  • [3] A Safe and Efficient Lane Change Decision-Making Strategy of Autonomous Driving Based on Deep Reinforcement Learning
    Lv, Kexuan
    Pei, Xiaofei
    Chen, Ci
    Xu, Jie
    MATHEMATICS, 2022, 10 (09)
  • [4] Reinforcement Learning Based Overtaking Decision-Making for Highway Autonomous Driving
    Li, Xin
    Xu, Xin
    Zuo, Lei
    2015 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2015, : 336 - 342
  • [5] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
    Guofa Li
    Shenglong Li
    Shen Li
    Yechen Qin
    Dongpu Cao
    Xingda Qu
    Bo Cheng
    Automotive Innovation, 2020, 3 : 374 - 385
  • [6] Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections
    Li, Guofa
    Li, Shenglong
    Li, Shen
    Qin, Yechen
    Cao, Dongpu
    Qu, Xingda
    Cheng, Bo
    AUTOMOTIVE INNOVATION, 2020, 3 (04) : 374 - 385
  • [7] Review of Autonomous Driving Decision-Making Research Based on Reinforcement Learning
    Jin L.
    Han G.
    Xie X.
    Guo B.
    Liu G.
    Zhu W.
    Qiche Gongcheng/Automotive Engineering, 2023, 45 (04): : 527 - 540
  • [8] A Decision-making Method for Longitudinal Autonomous Driving Based on Inverse Reinforcement Learning
    Gao Z.
    Yan X.
    Gao F.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (07): : 969 - 975
  • [9] Random Prior Network for Autonomous Driving Decision-Making Based on Reinforcement Learning
    Qiang, Yuchuan
    Wang, Xiaolan
    Wang, Yansong
    Zhang, Weiwei
    Xu, Jianxun
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (04)
  • [10] Towards Safe Autonomous Driving: Decision Making with Observation-Robust Reinforcement Learning
    Xiangkun He
    Chen Lv
    Automotive Innovation, 2023, 6 : 509 - 520