KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引：5

作者：

Lei, Lanxin ^{[1
]}

Luo, Ruiming ^{[2
]}

Zheng, Renjie ^{[1
]}

Wang, Jingke ^{[1
]}

Zhang, JianWei ^{[1
]}

Qiu, Cong ^{[1
]}

Ma, Liulong ^{[1
]}

Jin, Liyang ^{[1
]}

Zhang, Ping ^{[1
]}

Chen, Junbo ^{[1
]}

机构：

[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

来源：

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年

关键词：

OPTIMIZATION; SPACES;

D O I：

10.1109/IROS51168.2021.9636442

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.

引用

页码：4493 / 4500

页数：8

共 50 条

[21] Split Moves for Monte-Carlo Tree Search
Kowalski, Jakub
Mika, Maksymilian
Pawlik, Wojciech
Sutowicz, Jakub
Szykula, Marek
Winands, Mark H. M.
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10247 - 10255
[22] Convex Regularization in Monte-Carlo Tree Search
Dam, Tuan
D'Eramo, Carlo
Peters, Jan
Pajarinen, Joni
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[23] The Multiple Uses of Monte-Carlo Tree Search
Senington, Richard
SPS 2022, 2022, 21 : 713 - 724
[24] Driving Maneuvers Prediction Based Autonomous Driving Control by Deep Monte Carlo Tree Search
Chen, Jienan
Zhang, Cong
Luo, Jinting
Xie, Junfei
Wan, Yan
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (07) : 7146 - 7158
[25] Satellite Autonomous Mission Planning Based on Improved Monte Carlo Tree Search
Li, Zichao
Li, You
Luo, Rongzheng
SYMMETRY-BASEL, 2024, 16 (08):
[26] Epistemic Multi-agent Planning Using Monte-Carlo Tree Search
Reifsteck, Daniel
Engesser, Thorsten
Mattmueller, Robert
Nebel, Bernhard
ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 277 - 289
[27] Automated Machine Learning with Monte-Carlo Tree Search
Rakotoarison, Herilalaina
Schoenauer, Marc
Sebag, Michele
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
[28] Generalized Mean Estimation in Monte-Carlo Tree Search
Dam, Tuan
Klink, Pascal
D'Eramo, Carlo
Peters, Jan
Pajarinen, Joni
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2397 - 2404
[29] Monte-Carlo Tree Search by Best Arm Identification
Kaufmann, Emilie
Koolen, Wouter M.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[30] Monte-Carlo Tree Search for Scalable Coalition Formation
Wu, Feng
Ramchurn, Sarvapali D.
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 407 - 413

← 1 2 3 4 5 →