Rethinking Closed-Loop Training for Autonomous Driving

被引：1

作者：

Zhang, Chris ^{[1
,2
]}

Guo, Runsheng ^{[3
]}

Zeng, Wenyuan ^{[1
,2
]}

Xiong, Yuwen ^{[1
,2
]}

Dai, Binbin ^{[1
]}

Hu, Rui ^{[1
]}

Ren, Mengye ^{[4
]}

Urtasun, Raquel ^{[1
,2
]}

机构：

[1] Waabi, Toronto, ON, Canada

[2] Univ Toronto, Toronto, ON, Canada

[3] Univ Waterloo, Waterloo, ON, Canada

[4] NYU, New York, NY USA

来源：

COMPUTER VISION, ECCV 2022, PT XXXIX | 2022年 / 13699卷

关键词：

Closed-loop learning; Autonomous driving; RL; SHOGI; CHESS; GO;

D O I：

10.1007/978-3-031-19842-7_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advances in high-fidelity simulators [22,44,82] have enabled closed-loop training of autonomous driving agents, potentially solving the distribution shift in training v.s. deployment and allowing training to be scaled both safely and cheaply. However, there is a lack of understanding of how to build effective training benchmarks for closedloop training. In this work, we present the first empirical study which analyzes the effects of different training benchmark designs on the success of learning agents, such as how to design traffic scenarios and scale training environments. Furthermore, we show that many popular RL algorithms cannot achieve satisfactory performance in the context of autonomous driving, as they lack long-term planning and take an extremely long time to train. To address these issues, we propose trajectory value learning (TRAVL), an RL-based driving agent that performs planning with multistep look-ahead and exploits cheaply generated imagined data for efficient learning. Our experiments show that TRAVL can learn much faster and produce safer maneuvers compared to all the baselines.

引用

页码：264 / 282

页数：19

共 50 条

[1] MCMSys: Multimodal Data Closed-Loop Management System for Autonomous Driving
Li, He
Zhou, Zhaogao
Chen, Pin-tong
Yan, Jinjie
Yu, Rong
Hu, Ziwei
[J]. 2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 411 - 417
[2] A Probabilistic Approach to Mixed Open-loop and Closed-loop Control, with Application to Extreme Autonomous Driving
Kolter, J. Zico
Plagemann, Christian
Jackson, David T.
Ng, Andrew Y.
Thrun, Sebastian
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 839 - 845
[3] A Survey on Self-Evolving Autonomous Driving: A Perspective on Data Closed-Loop Technology
Li, Xincheng
Wang, Zhaoyi
Huang, Yanjun
Chen, Hong
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (11): : 4613 - 4631
[4] CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
Zhang, Linrui
Peng, Zhenghao
Li, Quanyi
Zhou, Bolei
[J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[5] Closed-loop Approach to Perception in Autonomous System
Samal, Kruttidipta
Wolf, Marilyn
Mukhopadhyay, Saibal
[J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 463 - 468
[6] Closed-Loop Training for Projected GAN
Zhao, Jiangwei
Zhang, Liang
Pan, Lili
Li, Hongliang
[J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 106 - 110
[7] Rethinking Pan-Sharpening in Closed-Loop Regularization
Zhou, Man
Huang, Jie
Hong, Danfeng
Zhao, Feng
Li, Chongyi
Chanussot, Jocelyn
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
[8] The Three Laws of Autonomous and Closed-Loop Systems in Anesthesia
Kuck, Kai
Johnson, Ken B.
[J]. ANESTHESIA AND ANALGESIA, 2017, 124 (02): : 377 - 380
[9] Autonomous Multisensor Calibration and Closed-loop Fusion for SLAM
Jacobson, Adam
Chen, Zetao
Milford, Michael
[J]. JOURNAL OF FIELD ROBOTICS, 2015, 32 (01) : 85 - 122
[10] Implantable and biodegradable closed-loop devices for autonomous electrotherapy
Zhang, Xiaoying
Mehvish, Darakhshan
Yang, Hui
[J]. SMARTMAT, 2023, 4 (03):

← 1 2 3 4 5 →