KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引:5
|
作者
Lei, Lanxin [1 ]
Luo, Ruiming [2 ]
Zheng, Renjie [1 ]
Wang, Jingke [1 ]
Zhang, JianWei [1 ]
Qiu, Cong [1 ]
Ma, Liulong [1 ]
Jin, Liyang [1 ]
Zhang, Ping [1 ]
Chen, Junbo [1 ]
机构
[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
关键词
OPTIMIZATION; SPACES;
D O I
10.1109/IROS51168.2021.9636442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.
引用
收藏
页码:4493 / 4500
页数:8
相关论文
共 50 条
  • [31] EXPERIMENTS WITH MONTE-CARLO TREE SEARCH IN THE GAME OF HAVANNAH
    Lorentz, Richard J.
    ICGA JOURNAL, 2011, 34 (03) : 140 - 149
  • [32] Monte-Carlo tree search as regularized policy optimization
    Grill, Jean-Bastien
    Altche, Florent
    Tang, Yunhao
    Hubert, Thomas
    Valko, Michal
    Antonoglou, Ioannis
    Munos, Remi
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [33] Converging to a Player Model In Monte-Carlo Tree Search
    Sarratt, Trevor
    Pynadath, David V.
    Jhala, Arnav
    2014 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2014,
  • [34] AIs for Dominion Using Monte-Carlo Tree Search
    Tollisen, Robin
    Jansen, Jon Vegard
    Goodwin, Morten
    Glimsdal, Sondre
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 43 - 52
  • [35] Parallel Monte-Carlo Tree Search with Simulation Servers
    Kato, Hideki
    Takeuchi, Ikuo
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 491 - 498
  • [36] A SHOGI PROGRAM BASED ON MONTE-CARLO TREE SEARCH
    Sato, Yoshikuni
    Takahashi, Daisuke
    Grimbergen, Reijer
    ICGA JOURNAL, 2010, 33 (02) : 80 - 92
  • [37] CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH
    Chaslot, Guillaume M. J. B.
    Winands, Mark H. M.
    Szita, Istvan
    van den Herik, H. Jaap
    ICGA JOURNAL, 2008, 31 (03) : 145 - 156
  • [38] Monte-Carlo Tree Search Parallelisation for Computer Go
    van Niekerk, Francois
    Kroon, Steve
    van Rooyen, Gert-Jan
    Inggs, Cornelia P.
    PROCEEDINGS OF THE SOUTH AFRICAN INSTITUTE FOR COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS CONFERENCE, 2012, : 129 - 138
  • [39] Can Monte-Carlo Tree Search learn to sacrifice?
    Nathan Companez
    Aldeida Aleti
    Journal of Heuristics, 2016, 22 : 783 - 813
  • [40] Monte-Carlo Tree Search for the Maximum Satisfiability Problem
    Goffinet, Jack
    Ramanujan, Raghuram
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, CP 2016, 2016, 9892 : 251 - 267