Efficient Sampling-Based Maximum Entropy Inverse Reinforcement Learning With Application to Autonomous Driving

被引:65
|
作者
Wu, Zheng [1 ]
Sun, Liting [1 ]
Zhan, Wei [1 ]
Yang, Chenyu [2 ]
Tomizuka, Masayoshi [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94709 USA
[2] Shanghai Jiao Tong Univ, Dept Comp Sci, Shanghai 200240, Peoples R China
关键词
Learning from demonstration; intelligent transportation systems; inverse reinforcement learning; autonomous driving; social human-robot interaction; ALGORITHMS;
D O I
10.1109/LRA.2020.3005126
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the past decades, we have witnessed significant progress in the domain of autonomous driving. Advanced techniques based on optimization and reinforcement learning become increasingly powerful when solving the forward problem: given designed reward/cost functions, how we should optimize them and obtain driving policies that interact with the environment safely and efficiently. Such progress has raised another equally important question: what should we optimize? Instead of manually specifying the reward functions, it is desired that we can extract what human drivers try to optimize from real traffic data and assign that to autonomous vehicles to enable more naturalistic and transparent interaction between humans and intelligent agents. To address this issue, we present an efficient sampling-based maximum-entropy inverse reinforcement learning (IRL) algorithm in this letter. Different from existing IRL algorithms, by introducing an efficient continuous-domain trajectory sampler, the proposed algorithm can directly learn the reward functions in the continuous domain while considering the uncertainties in demonstrated trajectories from human drivers. We evaluate the proposed algorithm via real-world driving data, including both non-interactive and interactive scenarios. The experimental results show that the proposed algorithm achieves more accurate prediction performance with faster convergence speed and better generalization compared to other baseline IRL algorithms.
引用
收藏
页码:5355 / 5362
页数:8
相关论文
共 50 条
  • [41] An Efficient Sampling-Based Path Planning for the Lunar Rover with Autonomous Target Seeking
    Chen, Gang
    You, Hong
    Huang, Zeyuan
    Fei, Junting
    Wang, Yifan
    Liu, Chuankai
    AEROSPACE, 2022, 9 (03)
  • [42] Safe reinforcement learning with mixture density network, with application to autonomous driving
    Baheri, Ali
    RESULTS IN CONTROL AND OPTIMIZATION, 2022, 6
  • [43] A Comprehensive Survey on the Application of Deep and Reinforcement Learning Approaches in Autonomous Driving
    Ben Elallid, Badr
    Benamar, Nabil
    Hafid, Abdelhakim Senhaji
    Rachidi, Tajjeeddine
    Mrani, Nabil
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7366 - 7390
  • [44] A Behavior Decision Method Based on Reinforcement Learning for Autonomous Driving
    Zheng, Kan
    Yang, Haojun
    Liu, Shiwen
    Zhang, Kuan
    Lei, Lei
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25386 - 25394
  • [45] PLASTR: Planning for Autonomous Sampling-Based Trowelling
    Kuhlmann-Jorgensen, Mads A.
    Pankert, Johannes
    Pietrasik, Lukasz L.
    Hutter, Marco
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 5069 - 5076
  • [46] An open framework for human-like autonomous driving using Inverse Reinforcement Learning
    Vasquez, Dizan
    Yu, Yufeng
    Kumar, Suryansh
    Laugier, Christian
    2014 IEEE VEHICLE POWER AND PROPULSION CONFERENCE (VPPC), 2014,
  • [47] Uncertainty-Aware Model-Based Reinforcement Learning: Methodology and Application in Autonomous Driving
    Wu, Jingda
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 194 - 203
  • [48] Sampling-Based Trajectory Repairing for Autonomous Vehicles
    Lin, Yuanfei
    Maierhofer, Sebastian
    Althoff, Matthias
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 572 - 579
  • [49] A direct sampling-based deep learning approach for inverse medium scattering problems
    Ning, Jianfeng
    Han, Fuqun
    Zou, Jun
    INVERSE PROBLEMS, 2024, 40 (01)
  • [50] Adaptive sampling-based motion planning with a non-conservatively defensive strategy for autonomous driving
    Li, Zhaoting
    Zhan, Wei
    Sun, Liting
    Chan, Ching-Yao
    Tomizuka, Masayoshi
    IFAC PAPERSONLINE, 2020, 53 (02): : 15632 - 15638