Merging in Congested Freeway Traffic Using Multipolicy Decision Making and Passive Actor-Critic Learning

被引：43

作者：

Nishi, Tomoki ^{[1
,2
]}

Doshi, Prashant ^{[3
]}

Prokhorov, Danil ^{[1
]}

机构：

[1] Toyota R&D, Ann Arbor, MI 48105 USA

[2] Toyota Cent Res & Dev Labs Inc, Nagakute, Aichi 4801192, Japan

[3] Univ Georgia, Dept Comp Sci, THINC Lab, Athens, GA 30622 USA

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2019年 / 4卷 / 02期

关键词：

Autonomous driving; decision making; freeway merging; reinforcement learning; linearly solvable MDP; REINFORCEMENT; SYSTEMS;

D O I：

10.1109/TIV.2019.2904417

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Merging in congested freeway traffic is a significant challenge toward realizing fully automated (level 4) driving. Merging vehicles need to decide not only how to merge safely into a spot, but also where to merge. We present a method for freeway merge based on multipolicy decision making coupled with a reinforcement learning technique called passive actor-critic (pAC), which learns with less knowledge of the system and without active exploration. The multipolicy decision making selects a candidate spot formerging by using the state value learned by pAC. Together, these techniques yield a method that first decides where to merge and then realizes safe merging. We evaluate our method using real traffic data. Our experiments show that pAC achieves an overall success rate of 92% for merging into a predetermined spot on a freeway, which is comparable to human decision making.

引用

页码：287 / 297

页数：11

共 50 条

[1] Merging with Extraction Method for Transfer Learning in Actor-Critic
Takano, Toshiaki
Takase, Haruhiko
Kawanaka, Hiroharu
Tsuruoka, Shinji
[J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2011, 15 (07) : 814 - 821
[2] A Discrete Soft Actor-Critic Decision-Making Strategy With Sample Filter for Freeway Autonomous Driving
Guan, Jiayi
Chen, Guang
Huang, Jin
Li, Zhijun
Xiong, Lu
Hou, Jing
Knoll, Alois
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2593 - 2598
[3] An actor-critic based learning method for decision-making and planning of autonomous vehicles
XU Can
ZHAO WanZhong
CHEN QingYun
WANG ChunYan
[J]. Science China Technological Sciences, 2021, 64 (05) : 984 - 994
[4] An actor-critic based learning method for decision-making and planning of autonomous vehicles
XU Can
ZHAO WanZhong
CHEN QingYun
WANG ChunYan
[J]. Science China(Technological Sciences), 2021, (05) - 994
[5] An actor-critic based learning method for decision-making and planning of autonomous vehicles
Xu Can
Zhao WanZhong
Chen QingYun
Wang ChunYan
[J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2021, 64 (05) : 984 - 994
[6] An actor-critic based learning method for decision-making and planning of autonomous vehicles
Can Xu
WanZhong Zhao
QingYun Chen
ChunYan Wang
[J]. Science China Technological Sciences, 2021, 64 : 984 - 994
[7] Distributed Actor-Critic Learning Using Emphatic Weightings
Stankovic, Milos S.
Beko, Marko
Stankovic, Srdjan S.
[J]. 2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1167 - 1172
[8] Procurement auctions using actor-critic type learning algorithm
Raju, CVL
Narahari, Y
Shah, S
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4588 - 4594
[9] Enhancing Cooperation of Vehicle Merging Control in Heavy Traffic Using Communication-Based Soft Actor-Critic Algorithm
Li, Meng
Li, Zhibin
Wang, Shunchao
Zheng, Si
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6491 - 6506
[10] Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning
Leibfried, Felix
Grau-Moya, Jordi
[J]. CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100

← 1 2 3 4 5 →