Self-adaptive MCTS for General Video Game Playing

被引：14

作者：

Sironi, Chiara F. ^{[1
]}

Liu, Jialin ^{[2
]}

Perez-Liebana, Diego ^{[2
]}

Gaina, Raluca D. ^{[2
]}

Bravi, Ivan ^{[2
]}

Lucas, Simon M. ^{[2
]}

Winands, Mark H. M. ^{[1
]}

机构：

[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Games & AI Grp, Maastricht, Netherlands

[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Game AI Grp, London, England

来源：

APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018 | 2018年 / 10784卷

基金：

英国工程与自然科学研究理事会;

关键词：

MCTS; On-line tuning; Self-adaptive Robust game playing; General video game playing; GO;

D O I：

10.1007/978-3-319-77538-8_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Monte-Carlo Tree Search (MCTS) has shown particular success in General Game Playing (GGP) and General Video Game Playing (GVGP) and many enhancements and variants have been developed. Recently, an on-line adaptive parameter tuning mechanism for MCTS agents has been proposed that almost achieves the same performance as off-line tuning in GGP. In this paper we apply the same approach to GVGP and use the popular General Video Game AI (GVGAI) framework, in which the time allowed to make a decision is only 40ms. We design three Self-Adaptive MCTS (SA-MCTS) agents that optimize on-line the parameters of a standard non-Self-Adaptive MCTS agent of GVGAI. The three agents select the parameter values using Naive Monte-Carlo, an Evolutionary Algorithm and an N-Tuple Bandit Evolutionary Algorithm respectively, and are tested on 20 single-player games of GVGAI. The SA-MCTS agents achieve more robust results on the tested games. With the same time setting, they perform similarly to the baseline standard MCTS agent in the games for which the baseline agent performs well, and significantly improve the win rate in the games for which the baseline agent performs poorly. As validation, we also test the performance of non-Self-Adaptive MCTS instances that use the most sampled parameter settings during the on-line tuning of each of the three SA-MCTS agents for each game. Results show that these parameter settings improve the win rate on the games Wait for Breakfast and Escape by 4 times and 150 times, respectively.

引用

页码：358 / 375

页数：18

共 50 条

[41] SOTA: Towards a General Model for Self-Adaptive Systems
Abeywickrama, Dhaminda B.
Bicocchi, Nicola
Zambonelli, Franco
2012 IEEE 21ST INTERNATIONAL WORKSHOP ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2012, : 48 - 53
[42] Procedural Level Generation with Answer Set Programming for General Video Game Playing
Neufeld, Xenija
Mostaghim, Sanaz
Perez-Liebana, Diego
2015 7TH COMPUTER SCIENCE AND ELECTRONIC ENGINEERING CONFERENCE (CEEC), 2015, : 207 - 212
[43] Analysis of Vanilla Rolling Horizon Evolution Parameters in General Video Game Playing
Gaina, Raluca D.
Liu, Jialin
Lucas, Simon M.
Perez-Liebana, Diego
APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 418 - 434
[44] On the Cross-Domain Reusability of Neural Modules for General Video Game Playing
Braylan, Alex
Hollenbeck, Mark
Meyerson, Elliot
Miikkulainen, Risto
COMPUTER GAMES, CGW 2015, 2016, 614 : 115 - 129
[45] Inductive general game playing
Cropper, Andrew
Evans, Richard
Law, Mark
MACHINE LEARNING, 2020, 109 (07) : 1393 - 1434
[46] Inductive general game playing
Andrew Cropper
Richard Evans
Mark Law
Machine Learning, 2020, 109 : 1393 - 1434
[47] A Novel Video Coding Framework Using a Self-Adaptive Dictionary
Xue, Yuanyi
Wang, Yao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (12) : 3478 - 3491
[48] Population Seeding Techniques for Rolling Horizon Evolution in General Video Game Playing
Gaina, Raluca D.
Lucas, Simon M.
Perez-Liebana, Diego
2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 1956 - 1963
[49] Multi-Objective Tree Search Approaches for General Video Game Playing
Perez-Liebana, Diego
Mostaghim, Sanaz
Lucas, Simon M.
2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 624 - 631
[50] General Game Playing with Ants
Sharma, Shiven
Kobti, Ziad
Goodwin, Scott
SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2008, 5361 : 381 - 390

← 1 2 3 4 5 →