KB-Tree: Learnable and Continuous Monte-Carlo Tree Search for Autonomous Driving Planning

被引:5
|
作者
Lei, Lanxin [1 ]
Luo, Ruiming [2 ]
Zheng, Renjie [1 ]
Wang, Jingke [1 ]
Zhang, JianWei [1 ]
Qiu, Cong [1 ]
Ma, Liulong [1 ]
Jin, Liyang [1 ]
Zhang, Ping [1 ]
Chen, Junbo [1 ]
机构
[1] Alibaba DAMO Acad, Dept Autonomous Driving Lab, Hangzhou, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
关键词
OPTIMIZATION; SPACES;
D O I
10.1109/IROS51168.2021.9636442
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel learnable and continuous Monte-Carlo Tree Search method, named as KB-Tree, for motion planning in autonomous driving. The proposed method utilizes an asymptotical PUCB based on Kernel Regression (KR-AUCB) as a novel UCB variant, to improve the exploitation and exploration performance. In addition, we further optimize the sampling in continuous space by adapting Bayesian Optimization (BO) in the selection process of MCTS. Moreover, we use a customized Graph Neural Network (GNN) as our feature extractor to improve the learning performance. To the best of our knowledge, we are the first to apply the continuous MCTS method in autonomous driving. To validate our method, we conduct extensive experiments under several weakly and strongly interactive scenarios. The results show that our proposed method performs well in all tasks, and outperforms the learning-based continuous MCTS method and the state-of-the-art Reinforcement Learning (RL) baseline.
引用
收藏
页码:4493 / 4500
页数:8
相关论文
共 50 条
  • [21] Split Moves for Monte-Carlo Tree Search
    Kowalski, Jakub
    Mika, Maksymilian
    Pawlik, Wojciech
    Sutowicz, Jakub
    Szykula, Marek
    Winands, Mark H. M.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10247 - 10255
  • [22] Convex Regularization in Monte-Carlo Tree Search
    Dam, Tuan
    D'Eramo, Carlo
    Peters, Jan
    Pajarinen, Joni
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] The Multiple Uses of Monte-Carlo Tree Search
    Senington, Richard
    SPS 2022, 2022, 21 : 713 - 724
  • [24] Driving Maneuvers Prediction Based Autonomous Driving Control by Deep Monte Carlo Tree Search
    Chen, Jienan
    Zhang, Cong
    Luo, Jinting
    Xie, Junfei
    Wan, Yan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (07) : 7146 - 7158
  • [25] Satellite Autonomous Mission Planning Based on Improved Monte Carlo Tree Search
    Li, Zichao
    Li, You
    Luo, Rongzheng
    SYMMETRY-BASEL, 2024, 16 (08):
  • [26] Epistemic Multi-agent Planning Using Monte-Carlo Tree Search
    Reifsteck, Daniel
    Engesser, Thorsten
    Mattmueller, Robert
    Nebel, Bernhard
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 277 - 289
  • [27] Automated Machine Learning with Monte-Carlo Tree Search
    Rakotoarison, Herilalaina
    Schoenauer, Marc
    Sebag, Michele
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3296 - 3303
  • [28] Generalized Mean Estimation in Monte-Carlo Tree Search
    Dam, Tuan
    Klink, Pascal
    D'Eramo, Carlo
    Peters, Jan
    Pajarinen, Joni
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2397 - 2404
  • [29] Monte-Carlo Tree Search by Best Arm Identification
    Kaufmann, Emilie
    Koolen, Wouter M.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [30] Monte-Carlo Tree Search for Scalable Coalition Formation
    Wu, Feng
    Ramchurn, Sarvapali D.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 407 - 413