Self-adaptive MCTS for General Video Game Playing

被引:14
|
作者
Sironi, Chiara F. [1 ]
Liu, Jialin [2 ]
Perez-Liebana, Diego [2 ]
Gaina, Raluca D. [2 ]
Bravi, Ivan [2 ]
Lucas, Simon M. [2 ]
Winands, Mark H. M. [1 ]
机构
[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Games & AI Grp, Maastricht, Netherlands
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Game AI Grp, London, England
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018 | 2018年 / 10784卷
基金
英国工程与自然科学研究理事会;
关键词
MCTS; On-line tuning; Self-adaptive Robust game playing; General video game playing; GO;
D O I
10.1007/978-3-319-77538-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte-Carlo Tree Search (MCTS) has shown particular success in General Game Playing (GGP) and General Video Game Playing (GVGP) and many enhancements and variants have been developed. Recently, an on-line adaptive parameter tuning mechanism for MCTS agents has been proposed that almost achieves the same performance as off-line tuning in GGP. In this paper we apply the same approach to GVGP and use the popular General Video Game AI (GVGAI) framework, in which the time allowed to make a decision is only 40ms. We design three Self-Adaptive MCTS (SA-MCTS) agents that optimize on-line the parameters of a standard non-Self-Adaptive MCTS agent of GVGAI. The three agents select the parameter values using Naive Monte-Carlo, an Evolutionary Algorithm and an N-Tuple Bandit Evolutionary Algorithm respectively, and are tested on 20 single-player games of GVGAI. The SA-MCTS agents achieve more robust results on the tested games. With the same time setting, they perform similarly to the baseline standard MCTS agent in the games for which the baseline agent performs well, and significantly improve the win rate in the games for which the baseline agent performs poorly. As validation, we also test the performance of non-Self-Adaptive MCTS instances that use the most sampled parameter settings during the on-line tuning of each of the three SA-MCTS agents for each game. Results show that these parameter settings improve the win rate on the games Wait for Breakfast and Escape by 4 times and 150 times, respectively.
引用
收藏
页码:358 / 375
页数:18
相关论文
共 50 条
  • [41] SOTA: Towards a General Model for Self-Adaptive Systems
    Abeywickrama, Dhaminda B.
    Bicocchi, Nicola
    Zambonelli, Franco
    2012 IEEE 21ST INTERNATIONAL WORKSHOP ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2012, : 48 - 53
  • [42] Procedural Level Generation with Answer Set Programming for General Video Game Playing
    Neufeld, Xenija
    Mostaghim, Sanaz
    Perez-Liebana, Diego
    2015 7TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2015, : 207 - 212
  • [43] Analysis of Vanilla Rolling Horizon Evolution Parameters in General Video Game Playing
    Gaina, Raluca D.
    Liu, Jialin
    Lucas, Simon M.
    Perez-Liebana, Diego
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 418 - 434
  • [44] On the Cross-Domain Reusability of Neural Modules for General Video Game Playing
    Braylan, Alex
    Hollenbeck, Mark
    Meyerson, Elliot
    Miikkulainen, Risto
    COMPUTER GAMES, CGW 2015, 2016, 614 : 115 - 129
  • [45] Inductive general game playing
    Cropper, Andrew
    Evans, Richard
    Law, Mark
    MACHINE LEARNING, 2020, 109 (07) : 1393 - 1434
  • [46] Inductive general game playing
    Andrew Cropper
    Richard Evans
    Mark Law
    Machine Learning, 2020, 109 : 1393 - 1434
  • [47] A Novel Video Coding Framework Using a Self-Adaptive Dictionary
    Xue, Yuanyi
    Wang, Yao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (12) : 3478 - 3491
  • [48] Population Seeding Techniques for Rolling Horizon Evolution in General Video Game Playing
    Gaina, Raluca D.
    Lucas, Simon M.
    Perez-Liebana, Diego
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 1956 - 1963
  • [49] Multi-Objective Tree Search Approaches for General Video Game Playing
    Perez-Liebana, Diego
    Mostaghim, Sanaz
    Lucas, Simon M.
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 624 - 631
  • [50] General Game Playing with Ants
    Sharma, Shiven
    Kobti, Ziad
    Goodwin, Scott
    SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2008, 5361 : 381 - 390