Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving

被引:1
|
作者
Huang, Zilin [1 ]
Sheng, Zihao [1 ]
Ma, Chengyuan [1 ]
Chen, Sikai [1 ]
机构
[1] Univ Wisconsin Madison, Dept Civil & Environm Engn, Madison, WI 53706 USA
关键词
Human as AI mentor paradigm; Autonomous driving; Deep reinforcement learning; Human -in -the -loop learning; Driving policy; Mixed traffic platoon; MODEL;
D O I
10.1016/j.commtr.2024.100127
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential accidents. On the other hand, the agent could be guided to minimize traffic flow disturbance, thereby optimizing traffic flow efficiency. In detail, HAIM-DRL leverages data collected from free exploration and partial human demonstrations as its two training sources. Remarkably, we circumvent the intricate process of manually designing reward functions; instead, we directly derive proxy state-action values from partial human demonstrations to guide the agents' policy learning. Additionally, we employ a minimal intervention technique to reduce the human mentor's cognitive load. Comparative results show that HAIM-DRL outperforms traditional methods in driving safety, sampling efficiency, mitigation of traffic flow disturbance, and generalizability to unseen traffic scenarios.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] A survey of human-in-the-loop for machine learning
    Wu, Xingjiao
    Xiao, Luwei
    Sun, Yixuan
    Zhang, Junhang
    Ma, Tianlong
    He, Liang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
  • [32] Human-in-the-loop Applied Machine Learning
    Brodley, Carla E.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
  • [33] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
    Gao, Xiang
    Si, Jennie
    Wen, Yue
    Li, Minhan
    Huang, He
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
  • [34] Artificial Swarm Intelligence, a Human-in-the-Loop Approach to AI
    Rosenberg, Louis
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4381 - 4382
  • [35] Constructing Ethical AI Based on the "Human-in-the-Loop" System
    Chen, Ximeng
    Wang, Xiaohong
    Qu, Yanzhang
    SYSTEMS, 2023, 11 (11):
  • [36] Human-in-the-Loop AI Reviewing: Feasibility, Opportunities, and Risks
    Drori, Iddo
    Te'eni, Dov
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2024, 25 (01): : 98 - 109
  • [37] Human-in-the-loop for Bayesian autonomous materials phase mapping
    Adams, Felix
    McDannald, Austin
    Takeuchi, Ichiro
    Kusne, Gilad
    MATTER, 2024, 7 (02) : 697 - 709
  • [38] Synthesis of Human-in-the-Loop Control Protocols for Autonomous Systems
    Feng, Lu
    Wiltsche, Clemens
    Humphrey, Laura
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 450 - 462
  • [39] Modeling Operator Performance in Human-in-the-Loop Autonomous Systems
    Muhammad, Shahabuddin
    IEEE ACCESS, 2021, 9 : 102715 - 102731
  • [40] Safe Reinforcement Learning in Autonomous Driving With Epistemic Uncertainty Estimation
    Zhang, Zheng
    Liu, Qi
    Li, Yanjie
    Lin, Ke
    Li, Linyu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 1 - 14