Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving

被引:1
|
作者
Huang, Zilin [1 ]
Sheng, Zihao [1 ]
Ma, Chengyuan [1 ]
Chen, Sikai [1 ]
机构
[1] Univ Wisconsin Madison, Dept Civil & Environm Engn, Madison, WI 53706 USA
关键词
Human as AI mentor paradigm; Autonomous driving; Deep reinforcement learning; Human -in -the -loop learning; Driving policy; Mixed traffic platoon; MODEL;
D O I
10.1016/j.commtr.2024.100127
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
Despite significant progress in autonomous vehicles (AVs), the development of driving policies that ensure both the safety of AVs and traffic flow efficiency has not yet been fully explored. In this paper, we propose an enhanced human-in-the-loop reinforcement learning method, termed the Human as AI mentor-based deep reinforcement learning (HAIM-DRL) framework, which facilitates safe and efficient autonomous driving in mixed traffic platoon. Drawing inspiration from the human learning process, we first introduce an innovative learning paradigm that effectively injects human intelligence into AI, termed Human as AI mentor (HAIM). In this paradigm, the human expert serves as a mentor to the AI agent. While allowing the agent to sufficiently explore uncertain environments, the human expert can take control in dangerous situations and demonstrate correct actions to avoid potential accidents. On the other hand, the agent could be guided to minimize traffic flow disturbance, thereby optimizing traffic flow efficiency. In detail, HAIM-DRL leverages data collected from free exploration and partial human demonstrations as its two training sources. Remarkably, we circumvent the intricate process of manually designing reward functions; instead, we directly derive proxy state-action values from partial human demonstrations to guide the agents' policy learning. Additionally, we employ a minimal intervention technique to reduce the human mentor's cognitive load. Comparative results show that HAIM-DRL outperforms traditional methods in driving safety, sampling efficiency, mitigation of traffic flow disturbance, and generalizability to unseen traffic scenarios.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Safe Q-Learning Approaches for Human-in-Loop Reinforcement Learning
    Veerabathraswamy, Swathi
    Bhatt, Nirav
    2023 NINTH INDIAN CONTROL CONFERENCE, ICC, 2023, : 16 - 21
  • [42] Human-in-the-Loop Behavior Modeling via an Integral Concurrent Adaptive Inverse Reinforcement Learning
    Wu, Huai-Ning
    Wang, Mi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 11359 - 11370
  • [43] Active Learning for Human-in-the-Loop Customs Inspection
    Kim, Sundong
    Mai, Tung-Duong
    Han, Sungwon
    Park, Sungwon
    Nguyen, D. K. Thi
    So, Jaechan
    Singh, Karandeep
    Cha, Meeyoung
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12039 - 12052
  • [44] HELIX: Accelerating Human-in-the-loop Machine Learning
    Xin, Doris
    Ma, Litian
    Liu, Jialin
    Macke, Stephen
    Song, Shuchen
    Parameswaran, Aditya
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 1958 - 1961
  • [45] Continual learning classification method with human-in-the-loop
    Liu, Jia
    Li, Dong
    Shan, Wangweiyi
    Liu, Shulin
    METHODSX, 2023, 11
  • [46] Human-in-the-loop Learning for Dynamic Congestion Games
    Li H.
    Duan L.
    IEEE Transactions on Mobile Computing, 2024, 23 (12) : 1 - 12
  • [47] Human-in-the-loop machine learning: a state of the art
    Mosqueira-Rey, Eduardo
    Hernandez-Pereira, Elena
    Alonso-Rios, David
    Bobes-Bascaran, Jose
    Fernandez-Leal, Angel
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 3005 - 3054
  • [48] Human-in-the-Loop Low-Shot Learning
    Wan, Sen
    Hou, Yimin
    Bao, Feng
    Ren, Zhiquan
    Dong, Yunfeng
    Dai, Qionghai
    Deng, Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3287 - 3292
  • [49] Human-in-the-Loop Learning With LLMs for Efficient RASE Tagging in Building Compliance Regulations
    Al-Turki, Dhoyazan
    Hettiarachchi, Hansi
    Medhat Gaber, Mohamed
    Abdelsamea, Mohammed M.
    Basurra, Shadi
    Iranmanesh, Sima
    Saadany, Hadeel
    Vakaj, Edlira
    IEEE Access, 2024, 12 : 185291 - 185306
  • [50] Human-in-the-loop machine learning: a state of the art
    Eduardo Mosqueira-Rey
    Elena Hernández-Pereira
    David Alonso-Ríos
    José Bobes-Bascarán
    Ángel Fernández-Leal
    Artificial Intelligence Review, 2023, 56 : 3005 - 3054