Driving in Dense Traffic with Model-Free Reinforcement Learning

被引:55
|
作者
Saxena, Dhruv Mauria [1 ]
Bae, Sangjae [2 ]
Nakhaei, Alireza [3 ]
Fujimura, Kikuo [3 ]
Likhachev, Maxim [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] Honda Res Inst USA Inc, San Jose, CA USA
关键词
D O I
10.1109/icra40945.2020.9197132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional planning and control methods could fail to find a feasible trajectory for an autonomous vehicle to execute amongst dense traffic on roads. This is because the obstacle-free volume in spacetime is very small in these scenarios for the vehicle to drive through. However, that does not mean the task is infeasible since human drivers are known to be able to drive amongst dense traffic by leveraging the cooperativeness of other drivers to open a gap. The traditional methods fail to take into account the fact that the actions taken by an agent affect the behaviour of other vehicles on the road. In this work, we rely on the ability of deep reinforcement learning to implicitly model such interactions and learn a continuous control policy over the action space of an autonomous vehicle. The application we consider requires our agent to negotiate and open a gap in the road in order to successfully merge or change lanes. Our policy learns to repeatedly probe into the target road lane while trying to find a safe spot to move in to. We compare against two model-predictive control-based algorithms and show that our policy outperforms them in simulation. As part of this work, we introduce a benchmark for driving in dense traffic for use by the community.
引用
收藏
页码:5385 / 5392
页数:8
相关论文
共 50 条
  • [41] Mastering the game of Stratego with model-free multiagent reinforcement learning
    Perolat, Julien
    De Vylder, Bart
    Hennes, Daniel
    Tarassov, Eugene
    Strub, Florian
    de Boer, Vincent
    Muller, Paul
    Connor, Jerome T.
    Burch, Neil
    Anthony, Thomas
    McAleer, Stephen
    Elie, Romuald
    Cen, Sarah H.
    Wang, Zhe
    Gruslys, Audrunas
    Malysheva, Aleksandra
    Khan, Mina
    Ozair, Sherjil
    Timbers, Finbarr
    Pohlen, Toby
    Eccles, Tom
    Rowland, Mark
    Lanctot, Marc
    Lespiau, Jean-Baptiste
    Piot, Bilal
    Omidshafiei, Shayegan
    Lockhart, Edward
    Sifre, Laurent
    Beauguerlange, Nathalie
    Munos, Remi
    Silver, David
    Singh, Satinder
    Hassabis, Demis
    Tuyls, Karl
    [J]. SCIENCE, 2022, 378 (6623) : 990 - +
  • [42] Model-free robust reinforcement learning via Polynomial Chaos
    Liu, Jianxiang
    Wu, Faguo
    Zhang, Xiao
    [J]. Knowledge-Based Systems, 2025, 309
  • [43] Omega-Regular Objectives in Model-Free Reinforcement Learning
    Hahn, Ernst Moritz
    Perez, Mateo
    Schewe, Sven
    Somenzi, Fabio
    Trivedi, Ashutosh
    Wojtczak, Dominik
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, 2019, 11427 : 395 - 412
  • [45] Model-free Control for Stratospheric Airship Based on Reinforcement Learning
    Nie, Chunyu
    Zhu, Ming
    Zheng, Zewei
    Wu, Zhe
    [J]. PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 10702 - 10707
  • [46] An Hybrid Model-Free Reinforcement Learning Approach for HVAC Control
    Solinas, Francesco M.
    Bellagarda, Andrea
    Macii, Enrico
    Patti, Edoardo
    Bottaccioli, Lorenzo
    [J]. 2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [47] Depth Control of Model-Free AUVs via Reinforcement Learning
    Wu, Hui
    Song, Shiji
    You, Keyou
    Wu, Cheng
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (12): : 2499 - 2510
  • [48] Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
    Zhang, Xuezhou
    Song, Yuda
    Uehara, Masatoshi
    Wang, Mengdi
    Agarwal, Alekh
    Sun, Wen
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [49] Model-free Predictive Optimal Iterative Learning Control using Reinforcement Learning
    Zhang, Yueqing
    Chu, Bing
    Shu, Zhan
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3279 - 3284
  • [50] Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
    Massi, Elisa
    Barthelemy, Jeanne
    Mailly, Juliane
    Dromnelle, Remi
    Canitrot, Julien
    Poniatowski, Esther
    Girard, Benoit
    Khamassi, Mehdi
    [J]. FRONTIERS IN NEUROROBOTICS, 2022, 16