KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引：5

作者：

Lei, Lanxin ^{[1
]}

Luo, Ruiming ^{[2
]}

Zheng, Renjie ^{[1
]}

Wang, Jingke ^{[1
]}

Zhang, JianWei ^{[1
]}

Qiu, Cong ^{[1
]}

Ma, Liulong ^{[1
]}

Jin, Liyang ^{[1
]}

Zhang, Ping ^{[1
]}

Chen, Junbo ^{[1
]}

机构：

[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

来源：

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年

关键词：

OPTIMIZATION; SPACES;

D O I：

10.1109/IROS51168.2021.9636442

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.

引用

页码：4493 / 4500

页数：8

共 50 条

[41] Parallel Monte-Carlo Tree Search for HPC Systems
Graf, Tobias
Lorenz, Ulf
Platzner, Marco
Schaefers, Lars
EURO-PAR 2011 PARALLEL PROCESSING, PT 2, 2011, 6853 : 365 - 376
[42] Can Monte-Carlo Tree Search learn to sacrifice?
Companez, Nathan
Aleti, Aldeida
JOURNAL OF HEURISTICS, 2016, 22 (06) : 783 - 813
[43] Bayesian Optimization for Backpropagation in Monte-Carlo Tree Search
Lim, Nengli
Li, Yueqin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 209 - 221
[44] Monte-Carlo Tree Search for the Game of Scotland Yard
Nijssen, J. A. M.
Winands, Mark H. M.
2011 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2011, : 158 - 165
[45] Monte-Carlo tree search as regularized policy optimization
Grill, Jean-Bastien
Altche, Florent
Tang, Yunhao
Hubert, Thomas
Valko, Michal
Antonoglou, Ioannis
Munos, Remi
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[46] Monte-Carlo tree search for Bayesian reinforcement learning
Ngo Anh Vien
Ertel, Wolfgang
Viet-Hung Dang
Chung, TaeChoong
APPLIED INTELLIGENCE, 2013, 39 (02) : 345 - 353
[47] Monte-Carlo tree search for Bayesian reinforcement learning
Ngo Anh Vien
Wolfgang Ertel
Viet-Hung Dang
TaeChoong Chung
Applied Intelligence, 2013, 39 : 345 - 353
[48] Using evaluation functions in Monte-Carlo Tree Search
Lorentz, Richard
THEORETICAL COMPUTER SCIENCE, 2016, 644 : 106 - 113
[49] Backpropagation Modification in Monte-Carlo Game Tree Search
Xie, Fan
Liu, Zhiqing
2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 125 - 128
[50] Single-Player Monte-Carlo Tree Search
Schadd, Maarten P. D.
Winands, Mark H. M.
van den Herik, H. Jaap
Chaslot, Guillaume M. J. -B.
Uiterwijk, Jos W. H. M.
COMPUTERS AND GAMES, 2008, 5131 : 1 - +

← 1 2 3 4 5 →