Self-adaptive MCTS for General Video Game Playing

被引:14
|
作者
Sironi, Chiara F. [1 ]
Liu, Jialin [2 ]
Perez-Liebana, Diego [2 ]
Gaina, Raluca D. [2 ]
Bravi, Ivan [2 ]
Lucas, Simon M. [2 ]
Winands, Mark H. M. [1 ]
机构
[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Games & AI Grp, Maastricht, Netherlands
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Game AI Grp, London, England
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018 | 2018年 / 10784卷
基金
英国工程与自然科学研究理事会;
关键词
MCTS; On-line tuning; Self-adaptive Robust game playing; General video game playing; GO;
D O I
10.1007/978-3-319-77538-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte-Carlo Tree Search (MCTS) has shown particular success in General Game Playing (GGP) and General Video Game Playing (GVGP) and many enhancements and variants have been developed. Recently, an on-line adaptive parameter tuning mechanism for MCTS agents has been proposed that almost achieves the same performance as off-line tuning in GGP. In this paper we apply the same approach to GVGP and use the popular General Video Game AI (GVGAI) framework, in which the time allowed to make a decision is only 40ms. We design three Self-Adaptive MCTS (SA-MCTS) agents that optimize on-line the parameters of a standard non-Self-Adaptive MCTS agent of GVGAI. The three agents select the parameter values using Naive Monte-Carlo, an Evolutionary Algorithm and an N-Tuple Bandit Evolutionary Algorithm respectively, and are tested on 20 single-player games of GVGAI. The SA-MCTS agents achieve more robust results on the tested games. With the same time setting, they perform similarly to the baseline standard MCTS agent in the games for which the baseline agent performs well, and significantly improve the win rate in the games for which the baseline agent performs poorly. As validation, we also test the performance of non-Self-Adaptive MCTS instances that use the most sampled parameter settings during the on-line tuning of each of the three SA-MCTS agents for each game. Results show that these parameter settings improve the win rate on the games Wait for Breakfast and Escape by 4 times and 150 times, respectively.
引用
收藏
页码:358 / 375
页数:18
相关论文
共 50 条
  • [21] A Model Free Approach To General Video Game Playing
    Mehdikhani, Mehdi
    Fahami, Mohammad Amin
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2017, : 925 - 930
  • [22] Ensemble Decision Systems for General Video Game Playing
    Anderson, Damien
    Rodgers, Philip
    Levine, John
    Guerrero-Romero, Cristina
    Perez-Liebana, Diego
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [23] Informed Hybrid Game Tree Search for General Video Game Playing
    Joppen, Tobias
    Moneke, Miriam Ulrike
    Schroeder, Nils
    Wirth, Christian
    Fuernkranz, Johannes
    IEEE TRANSACTIONS ON GAMES, 2018, 10 (01) : 78 - 90
  • [24] Evolving Game-Specific UCB Alternatives for General Video Game Playing
    Bravi, Ivan
    Khalifa, Ahmed
    Holmgard, Christoffer
    Togelius, Julian
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 393 - 406
  • [25] Rolling Horizon Evolutionary Algorithms for General Video Game Playing
    Gaina, Raluca D.
    Devlin, Sam
    Lucas, Simon M.
    Perez-Liebana, Diego
    IEEE TRANSACTIONS ON GAMES, 2022, 14 (02) : 232 - 242
  • [26] Self-adaptive methods for general variational inequalities
    Noor, Muhammad Aslam
    Bnouhachem, Abdellah
    Ullah, Saleem
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2009, 71 (09) : 3728 - 3738
  • [27] Self-adaptive SURF for image-to-video matching
    Yang, Ming
    Li, Jiaming
    Li, Zhigang
    Li, Wen
    Zhang, Kairui
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 751 - 759
  • [28] Monte Carlo Tree Search with Options for General Video Game Playing
    de Waard, Maarten
    Roijers, Diederik M.
    Bakkes, Sander C. J.
    2016 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2016,
  • [29] A self-adaptive subtraction algorithm for dynamic background video
    An, Zhiyong
    Zhang, JiaHui
    Chen, Shuying
    Ji, Hanran
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [30] Self-adaptive Detection of Moving Vehicles in Traffic Video
    Zhai Hai-tao
    Wu Jian
    Xia Jie
    Cui Zhi-ming
    2009 INTERNATIONAL SYMPOSIUM ON WEB INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 449 - 453