Self-adaptive MCTS for General Video Game Playing

被引:14
|
作者
Sironi, Chiara F. [1 ]
Liu, Jialin [2 ]
Perez-Liebana, Diego [2 ]
Gaina, Raluca D. [2 ]
Bravi, Ivan [2 ]
Lucas, Simon M. [2 ]
Winands, Mark H. M. [1 ]
机构
[1] Maastricht Univ, Dept Data Sci & Knowledge Engn, Games & AI Grp, Maastricht, Netherlands
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Game AI Grp, London, England
来源
APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018 | 2018年 / 10784卷
基金
英国工程与自然科学研究理事会;
关键词
MCTS; On-line tuning; Self-adaptive Robust game playing; General video game playing; GO;
D O I
10.1007/978-3-319-77538-8_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Monte-Carlo Tree Search (MCTS) has shown particular success in General Game Playing (GGP) and General Video Game Playing (GVGP) and many enhancements and variants have been developed. Recently, an on-line adaptive parameter tuning mechanism for MCTS agents has been proposed that almost achieves the same performance as off-line tuning in GGP. In this paper we apply the same approach to GVGP and use the popular General Video Game AI (GVGAI) framework, in which the time allowed to make a decision is only 40ms. We design three Self-Adaptive MCTS (SA-MCTS) agents that optimize on-line the parameters of a standard non-Self-Adaptive MCTS agent of GVGAI. The three agents select the parameter values using Naive Monte-Carlo, an Evolutionary Algorithm and an N-Tuple Bandit Evolutionary Algorithm respectively, and are tested on 20 single-player games of GVGAI. The SA-MCTS agents achieve more robust results on the tested games. With the same time setting, they perform similarly to the baseline standard MCTS agent in the games for which the baseline agent performs well, and significantly improve the win rate in the games for which the baseline agent performs poorly. As validation, we also test the performance of non-Self-Adaptive MCTS instances that use the most sampled parameter settings during the on-line tuning of each of the three SA-MCTS agents for each game. Results show that these parameter settings improve the win rate on the games Wait for Breakfast and Escape by 4 times and 150 times, respectively.
引用
收藏
页码:358 / 375
页数:18
相关论文
共 50 条
  • [31] Reinforcement Learning With Dual-Observation for General Video Game Playing
    Hu, Chengpeng
    Wang, Ziqi
    Shu, Tianye
    Tong, Hao
    Togelius, Julian
    Yao, Xin
    Liu, Jialin
    IEEE TRANSACTIONS ON GAMES, 2023, 15 (02) : 202 - 216
  • [32] Efficient Implementation of Breadth First Search for General Video Game Playing
    Ito, Suguru
    Guo, Zikun
    Chu, Chun Yin
    Harada, Tomohiro
    Thawonmas, Ruck
    2016 IEEE 5TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS, 2016,
  • [33] Shallow Decision-Making Analysis in General Video Game Playing
    Bravi, Ivan
    Perez-Liebana, Diego
    Lucas, Simon M.
    Liu, Jialin
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 1 - 8
  • [34] Self-adaptive SURF for image-to-video matching
    Ming Yang
    Jiaming Li
    Zhigang Li
    Wen Li
    Kairui Zhang
    Signal, Image and Video Processing, 2024, 18 (1) : 751 - 759
  • [35] Video game: Playing with the planet
    Nicola Jones
    Nature Climate Change, 2011, 1 (1) : 17 - 18
  • [36] AGE AND VIDEO GAME PLAYING
    MCCLURE, RF
    PERCEPTUAL AND MOTOR SKILLS, 1985, 61 (01) : 285 - 286
  • [37] Sleep quality and video game playing: Effect of intensity of video game playing and mental health
    Altintas, Emin
    Karaca, Yasemin
    Hullaert, Timothe
    Tassi, Patricia
    PSYCHIATRY RESEARCH, 2019, 273 : 487 - 492
  • [38] On self-adaptive method for general mixed variational inequalities
    Bnouhachem, Abdellah
    Noor, Muhammad Aslam
    Al-Shemas, Eman H.
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2008, 2008
  • [39] A self-adaptive predictive policy for pursuit-evasion game
    Luo, Zhen
    Cao, Qi-Xin
    Zhao, Yan-Zheng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2008, 24 (05) : 1397 - 1407
  • [40] Self-adaptive projection algorithms for general variational inequalities
    Noor, MA
    Noor, KI
    APPLIED MATHEMATICS AND COMPUTATION, 2004, 151 (03) : 659 - 670