Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving

被引:1
|
作者
Huang, Zilin [1 ]
Sheng, Zihao [1 ]
Ma, Chengyuan [1 ]
Chen, Sikai [1 ]
机构
[1] Univ Wisconsin Madison, Dept Civil & Environm Engn, Madison, WI 53706 USA
关键词
Human as AI mentor paradigm; Autonomous driving; Deep reinforcement learning; Human -in -the -loop learning; Driving policy; Mixed traffic platoon; MODEL;
D O I
10.1016/j.commtr.2024.100127
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential accidents. On the other hand, the agent could be guided to minimize traffic flow disturbance, thereby optimizing traffic flow efficiency. In detail, HAIM-DRL leverages data collected from free exploration and partial human demonstrations as its two training sources. Remarkably, we circumvent the intricate process of manually designing reward functions; instead, we directly derive proxy state-action values from partial human demonstrations to guide the agents' policy learning. Additionally, we employ a minimal intervention technique to reduce the human mentor's cognitive load. Comparative results show that HAIM-DRL outperforms traditional methods in driving safety, sampling efficiency, mitigation of traffic flow disturbance, and generalizability to unseen traffic scenarios.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Toward Human-in-the-Loop AI: Enhancing Deep Reinforcement Learning via Real-Time Human Guidance for Autonomous Driving
    Wu, Jingda
    Huang, Zhiyu
    Hu, Zhongxu
    Lv, Chen
    ENGINEERING, 2023, 21 : 75 - 91
  • [2] Safety-Aware Human-in-the-Loop Reinforcement Learning With Shared Control for Autonomous Driving
    Huang, Wenhui
    Liu, Haochen
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, : 16181 - 16192
  • [3] Human-in-the-loop Reinforcement Learning
    Liang, Huanghuang
    Yang, Lu
    Cheng, Hong
    Tu, Wenzhe
    Xu, Mengjie
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4511 - 4518
  • [4] Autonomous Driving Traffic Control Based on Human-in-the-loop Decisions
    Pang, Yan
    Zhang, Gaochang
    Xia, Hao
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 1116 - 1121
  • [5] Enabling Autonomous Medical Image Data Annotation: A human-in-the-loop Reinforcement Learning Approach
    da Cruz, Leonardo C.
    Sierra-Franco, Cesar A.
    Silva-Calpa, Greis Francy M.
    Raposo, Alberto Barbosa
    PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 271 - 279
  • [6] Value Driven Representation for Human-in-the-Loop Reinforcement Learning
    Keramati, Ramtin
    Brunskill, Emma
    ACM UMAP '19: PROCEEDINGS OF THE 27TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, 2019, : 176 - 180
  • [7] Reinforcement Learning Requires Human-in-the-Loop Framing and Approaches
    Taylor, Matthew E.
    HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 351 - 360
  • [8] Where to Add Actions in Human-in-the-Loop Reinforcement Learning
    Mandel, Travis
    Liu, Yun-En
    Brunskill, Emma
    Popovic, Zoran
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2322 - 2328
  • [9] Human-in-the-Loop Learning Methods Toward Safe DL-Based Autonomous Systems: A Review
    Rajendran, Prajit T.
    Espinoza, Huascar
    Delaborde, Agnes
    Mraidha, Chokri
    COMPUTER SAFETY, RELIABILITY, AND SECURITY (SAFECOMP 2021), 2021, 12853 : 251 - 264
  • [10] ASHA: Assistive Teleoperation via Human-in-the-Loop Reinforcement Learning
    Chen, Sean
    Gao, Jensen
    Reddy, Siddharth
    Berseth, Glen
    Dragan, Anca D.
    Levine, Sergey
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 7505 - 7512