Simulated Annealing Monte Carlo Tree Search for large POMDPs

被引:2
|
作者
Xiong, Kai [1 ]
Jiang, Hong [1 ]
机构
[1] Southwest Univ Sci & Technol, Mianyang 621010, Si Chuan, Peoples R China
关键词
Simulated Annealing; MCTS; POMDPs;
D O I
10.1109/IHMSC.2014.42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many planning and control problems can be modeled as large POMDPs, but very few can be solved scalably because of their computational complexity. This paper proposes a Simulated Annealing based on the Monte Carlo Tree Search for large POMDPs. The proposed algorithm determines an acceptance probability of sampling a back-propagation's outcome in the simulated annealing process. The experiments show that the proposed SAMCTS (Simulated Annealing Monte Carlo Tree Search) outperforms the original Simulated Annealing algorithm when applied to a large POMDP benchmark problem.
引用
收藏
页码:140 / 143
页数:4
相关论文
共 50 条
  • [1] Learning in POMDPs with Monte Carlo Tree Search
    Katt, Sammie
    Oliehoek, Frans A.
    Amato, Christopher
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Monte-Carlo Tree Search for Constrained POMDPs
    Lee, Jongmin
    Kim, Geon-Hyeong
    Poupart, Pascal
    Kim, Kee-Eung
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Online Planning for Interactive-POMDPs using Nested Monte Carlo Tree Search
    Schwartz, Jonathon
    Zhou, Ruijia
    Kurniawati, Hanna
    [J]. 2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8770 - 8777
  • [4] Monte Carlo POMDPs
    Thrun, S
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1064 - 1070
  • [5] Monte-Carlo Search for an Equilibrium in Dec-POMDPs
    You, Yang
    Thomas, Vincent
    Colas, Francis
    Buffet, Olivier
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2444 - 2453
  • [6] Sequential Monte Carlo simulated annealing
    Zhou, Enlu
    Chen, Xi
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2013, 55 (01) : 101 - 124
  • [7] Sequential Monte Carlo simulated annealing
    Enlu Zhou
    Xi Chen
    [J]. Journal of Global Optimization, 2013, 55 : 101 - 124
  • [8] Simulated Annealing Using Hybrid Monte Carlo
    R. Salazar
    R. Toral
    [J]. Journal of Statistical Physics, 1997, 89 : 1047 - 1060
  • [9] Simulated annealing using hybrid Monte Carlo
    Salazar, R
    Toral, R
    [J]. JOURNAL OF STATISTICAL PHYSICS, 1997, 89 (5-6) : 1047 - 1060
  • [10] Large Scale Hard Sample Mining with Monte Carlo Tree Search
    Canevet, Olivier
    Fleuret, Francois
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5128 - 5137