KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引:5
|
作者
Lei, Lanxin [1 ]
Luo, Ruiming [2 ]
Zheng, Renjie [1 ]
Wang, Jingke [1 ]
Zhang, JianWei [1 ]
Qiu, Cong [1 ]
Ma, Liulong [1 ]
Jin, Liyang [1 ]
Zhang, Ping [1 ]
Chen, Junbo [1 ]
机构
[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
关键词
OPTIMIZATION; SPACES;
D O I
10.1109/IROS51168.2021.9636442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.
引用
收藏
页码:4493 / 4500
页数:8
相关论文
共 50 条
  • [1] Tactical Cooperative Planning for Autonomous Highway Driving using Monte-Carlo Tree Search
    Lenz, David
    Kessler, Tobias
    Knoll, Alois
    2016 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2016, : 447 - 453
  • [2] Monte-Carlo Tree Search in Dragline Operation Planning
    Liu, Haoquan
    Austin, Kevin
    Forbes, Michael
    Kearney, Michael
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (01): : 419 - 425
  • [3] Monte-Carlo Tree Search with Tree Shape Control
    Marchenko, Oleksandr I.
    Marchenko, Oleksii O.
    2017 IEEE FIRST UKRAINE CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (UKRCON), 2017, : 812 - 817
  • [4] Monte-Carlo Tree Search for Logistics
    Edelkamp, Stefan
    Gath, Max
    Greulich, Christoph
    Humann, Malte
    Herzog, Otthein
    Lawo, Michael
    COMMERCIAL TRANSPORT, 2016, : 427 - 440
  • [5] Monte-Carlo Tree Search Solver
    Winands, Mark H. M.
    Bjornsson, Yngvi
    Saito, Jahn-Takeshi
    COMPUTERS AND GAMES, 2008, 5131 : 25 - +
  • [6] Parallel Monte-Carlo Tree Search
    Chaslot, Guillaume M. J. -B.
    Winands, Mark H. M.
    van den Herik, H. Jaap
    COMPUTERS AND GAMES, 2008, 5131 : 60 - +
  • [7] Multiple Tree for Partially Observable Monte-Carlo Tree Search
    Auger, David
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, 2011, 6624 : 53 - 62
  • [8] Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning
    Labbe, Yann
    Zagoruyko, Sergey
    Kalevatykh, Igor
    Laptev, Ivan
    Carpentier, Justin
    Aubry, Mathieu
    Sivic, Josef
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) : 3715 - 3722
  • [9] Nested Monte-Carlo Tree Search for Online Planning in Large MDPs
    Baier, Hendrik
    Winands, Mark H. M.
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 109 - 114
  • [10] Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients
    Lee, Jongmin
    Jeon, Wonseok
    Kim, Geon-Hyeong
    Kim, Kee-Eung
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4561 - 4568