KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引：5

作者：

Lei, Lanxin ^{[1
]}

Luo, Ruiming ^{[2
]}

Zheng, Renjie ^{[1
]}

Wang, Jingke ^{[1
]}

Zhang, JianWei ^{[1
]}

Qiu, Cong ^{[1
]}

Ma, Liulong ^{[1
]}

Jin, Liyang ^{[1
]}

Zhang, Ping ^{[1
]}

Chen, Junbo ^{[1
]}

机构：

[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

来源：

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年

关键词：

OPTIMIZATION; SPACES;

D O I：

10.1109/IROS51168.2021.9636442

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.

引用

页码：4493 / 4500

页数：8

共 50 条

[31] EXPERIMENTS WITH MONTE-CARLO TREE SEARCH IN THE GAME OF HAVANNAH
Lorentz, Richard J.
ICGA JOURNAL, 2011, 34 (03) : 140 - 149
[32] Monte-Carlo tree search as regularized policy optimization
Grill, Jean-Bastien
Altche, Florent
Tang, Yunhao
Hubert, Thomas
Valko, Michal
Antonoglou, Ioannis
Munos, Remi
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[33] Converging to a Player Model In Monte-Carlo Tree Search
Sarratt, Trevor
Pynadath, David V.
Jhala, Arnav
2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2014,
[34] AIs for Dominion Using Monte-Carlo Tree Search
Tollisen, Robin
Jansen, Jon Vegard
Goodwin, Morten
Glimsdal, Sondre
CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 43 - 52
[35] Parallel Monte-Carlo Tree Search with Simulation Servers
Kato, Hideki
Takeuchi, Ikuo
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 491 - 498
[36] A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH
Sato, Yoshikuni
Takahashi, Daisuke
Grimbergen, Reijer
ICGA JOURNAL, 2010, 33 (02) : 80 - 92
[37] CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH
Chaslot, Guillaume M. J. B.
Winands, Mark H. M.
Szita, Istvan
van den Herik, H. Jaap
ICGA JOURNAL, 2008, 31 (03) : 145 - 156
[38] Monte-Carlo Tree Search Parallelisation for Computer Go
van Niekerk, Francois
Kroon, Steve
van Rooyen, Gert-Jan
Inggs, Cornelia P.
PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 129 - 138
[39] Can Monte-Carlo Tree Search learn to sacrifice?
Nathan Companez
Aldeida Aleti
Journal of Heuristics, 2016, 22 : 783 - 813
[40] Monte-Carlo Tree Search for the Maximum Satisfiability Problem
Goffinet, Jack
Ramanujan, Raghuram
PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2016, 2016, 9892 : 251 - 267

← 1 2 3 4 5 →