Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning

被引:43
|
作者
Nishi, Tomoki [1 ,2 ]
Doshi, Prashant [3 ]
Prokhorov, Danil [1 ]
机构
[1] Toyota R&D, Ann Arbor, MI 48105 USA
[2] Toyota Cent Res & Dev Labs Inc, Nagakute, Aichi 4801192, Japan
[3] Univ Georgia, Dept Comp Sci, THINC Lab, Athens, GA 30622 USA
来源
关键词
Autonomous driving; decision making; freeway merging; reinforcement learning; linearly solvable MDP; REINFORCEMENT; SYSTEMS;
D O I
10.1109/TIV.2019.2904417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Merging in congested freeway traffic is a significant challenge toward realizing fully automated (level 4) driving. Merging vehicles need to decide not only how to merge safely into a spot, but also where to merge. We present a method for freeway merge based on multipolicy decision making coupled with a reinforcement learning technique called passive actor-critic (pAC), which learns with less knowledge of the system and without active exploration. The multipolicy decision making selects a candidate spot formerging by using the state value learned by pAC. Together, these techniques yield a method that first decides where to merge and then realizes safe merging. We evaluate our method using real traffic data. Our experiments show that pAC achieves an overall success rate of 92% for merging into a predetermined spot on a freeway, which is comparable to human decision making.
引用
收藏
页码:287 / 297
页数:11
相关论文
共 50 条
  • [1] Merging with Extraction Method for Transfer Learning in Actor-Critic
    Takano, Toshiaki
    Takase, Haruhiko
    Kawanaka, Hiroharu
    Tsuruoka, Shinji
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (07) : 814 - 821
  • [2] A Discrete Soft Actor-Critic Decision-Making Strategy With Sample Filter for Freeway Autonomous Driving
    Guan, Jiayi
    Chen, Guang
    Huang, Jin
    Li, Zhijun
    Xiong, Lu
    Hou, Jing
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2593 - 2598
  • [3] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    XU Can
    ZHAO WanZhong
    CHEN QingYun
    WANG ChunYan
    [J]. Science China Technological Sciences, 2021, 64 (05) : 984 - 994
  • [4] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    XU Can
    ZHAO WanZhong
    CHEN QingYun
    WANG ChunYan
    [J]. Science China(Technological Sciences), 2021, (05) - 994
  • [5] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    Xu Can
    Zhao WanZhong
    Chen QingYun
    Wang ChunYan
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2021, 64 (05) : 984 - 994
  • [6] An actor-critic based learning method for decision-making and planning of autonomous vehicles
    Can Xu
    WanZhong Zhao
    QingYun Chen
    ChunYan Wang
    [J]. Science China Technological Sciences, 2021, 64 : 984 - 994
  • [7] Distributed Actor-Critic Learning Using Emphatic Weightings
    Stankovic, Milos S.
    Beko, Marko
    Stankovic, Srdjan S.
    [J]. 2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1167 - 1172
  • [8] Procurement auctions using actor-critic type learning algorithm
    Raju, CVL
    Narahari, Y
    Shah, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4588 - 4594
  • [9] Enhancing Cooperation of Vehicle Merging Control in Heavy Traffic Using Communication-Based Soft Actor-Critic Algorithm
    Li, Meng
    Li, Zhibin
    Wang, Shunchao
    Zheng, Si
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6491 - 6506
  • [10] Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
    Leibfried, Felix
    Grau-Moya, Jordi
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100