Rethinking Closed-Loop Training for Autonomous Driving

被引:1
|
作者
Zhang, Chris [1 ,2 ]
Guo, Runsheng [3 ]
Zeng, Wenyuan [1 ,2 ]
Xiong, Yuwen [1 ,2 ]
Dai, Binbin [1 ]
Hu, Rui [1 ]
Ren, Mengye [4 ]
Urtasun, Raquel [1 ,2 ]
机构
[1] Waabi, Toronto, ON, Canada
[2] Univ Toronto, Toronto, ON, Canada
[3] Univ Waterloo, Waterloo, ON, Canada
[4] NYU, New York, NY USA
来源
关键词
Closed-loop learning; Autonomous driving; RL; SHOGI; CHESS; GO;
D O I
10.1007/978-3-031-19842-7_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in high-fidelity simulators [22,44,82] have enabled closed-loop training of autonomous driving agents, potentially solving the distribution shift in training v.s. deployment and allowing training to be scaled both safely and cheaply. However, there is a lack of understanding of how to build effective training benchmarks for closedloop training. In this work, we present the first empirical study which analyzes the effects of different training benchmark designs on the success of learning agents, such as how to design traffic scenarios and scale training environments. Furthermore, we show that many popular RL algorithms cannot achieve satisfactory performance in the context of autonomous driving, as they lack long-term planning and take an extremely long time to train. To address these issues, we propose trajectory value learning (TRAVL), an RL-based driving agent that performs planning with multistep look-ahead and exploits cheaply generated imagined data for efficient learning. Our experiments show that TRAVL can learn much faster and produce safer maneuvers compared to all the baselines.
引用
收藏
页码:264 / 282
页数:19
相关论文
共 50 条
  • [1] MCMSys: Multimodal Data Closed-Loop Management System for Autonomous Driving
    Li, He
    Zhou, Zhaogao
    Chen, Pin-tong
    Yan, Jinjie
    Yu, Rong
    Hu, Ziwei
    [J]. 2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 411 - 417
  • [2] A Probabilistic Approach to Mixed Open-loop and Closed-loop Control, with Application to Extreme Autonomous Driving
    Kolter, J. Zico
    Plagemann, Christian
    Jackson, David T.
    Ng, Andrew Y.
    Thrun, Sebastian
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 839 - 845
  • [3] A Survey on Self-Evolving Autonomous Driving: A Perspective on Data Closed-Loop Technology
    Li, Xincheng
    Wang, Zhaoyi
    Huang, Yanjun
    Chen, Hong
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (11): : 4613 - 4631
  • [4] CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
    Zhang, Linrui
    Peng, Zhenghao
    Li, Quanyi
    Zhou, Bolei
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [5] Closed-loop Approach to Perception in Autonomous System
    Samal, Kruttidipta
    Wolf, Marilyn
    Mukhopadhyay, Saibal
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 463 - 468
  • [6] Closed-Loop Training for Projected GAN
    Zhao, Jiangwei
    Zhang, Liang
    Pan, Lili
    Li, Hongliang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 106 - 110
  • [7] Rethinking Pan-Sharpening in Closed-Loop Regularization
    Zhou, Man
    Huang, Jie
    Hong, Danfeng
    Zhao, Feng
    Li, Chongyi
    Chanussot, Jocelyn
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [8] The Three Laws of Autonomous and Closed-Loop Systems in Anesthesia
    Kuck, Kai
    Johnson, Ken B.
    [J]. ANESTHESIA AND ANALGESIA, 2017, 124 (02): : 377 - 380
  • [9] Autonomous Multisensor Calibration and Closed-loop Fusion for SLAM
    Jacobson, Adam
    Chen, Zetao
    Milford, Michael
    [J]. JOURNAL OF FIELD ROBOTICS, 2015, 32 (01) : 85 - 122
  • [10] Implantable and biodegradable closed-loop devices for autonomous electrotherapy
    Zhang, Xiaoying
    Mehvish, Darakhshan
    Yang, Hui
    [J]. SMARTMAT, 2023, 4 (03):