Monotonic Model Improvement Self-play Algorithm for Adversarial Games

被引:0
|
作者
Sundar, Poorna Syama [1 ]
Vasam, Manjunath [1 ]
Joseph, Ajin George [1 ]
机构
[1] Indian Inst Technol Tirupati, Dept Comp Sci & Engn, Tirupati, Andhra Pradesh, India
关键词
D O I
10.1109/CDC49753.2023.10383417
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of solving strategy games has intrigued the scientific community for centuries. In this paper, we consider two-player adversarial zero-sum symmetric games with zero information loss. Here, both players are continuously attempting to make decisions that will change the current game state to his/her advantage and hence the gains of one player are always equal to the losses of the other player. In this paper, we propose a model improvement self-play algorithm, where the agent iteratively switches roles to subdue the current adversary strategy. This monotonic improvement sequence leads to the ultimate development of a monolithic, competent absolute no-loss policy for the game environment. This tactic is the first of its kind in the setting of two-player adversarial games. Our approach could perform competitively and sometimes expertly in games such as 4x4 tic-tac-toe, 5x5 domineering, cram, and dots & boxes with a minimum number of moves.
引用
收藏
页码:5600 / 5605
页数:6
相关论文
共 50 条
  • [1] A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games
    Xiong, Wei
    Zhong, Han
    Shi, Chengshuai
    Shen, Cong
    Zhang, Tong
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [2] Fictitious Self-Play in Extensive-Form Games
    Heinrich, Johannes
    Lanctot, Marc
    Silver, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 805 - 813
  • [3] Temporal Induced Self-Play for Stochastic Bayesian Games
    Chen, Weizhe
    Zhou, Zihan
    Wu, Yi
    Fang, Fei
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 96 - 103
  • [4] Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability
    MacQueen, Revan
    Wright, James R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Extracting tactics learned from self-play in general games
    Soemers, Dennis J. N. J.
    Samothrakis, Spyridon
    Piette, Eric
    Stephenson, Matthew
    INFORMATION SCIENCES, 2023, 624 : 277 - 298
  • [6] Self-play reinforcement learning with comprehensive critic in computer games
    Liu, Shanqi
    Cao, Junjie
    Wang, Yujie
    Chen, Wenzhou
    Liu, Yong
    NEUROCOMPUTING, 2021, 449 : 207 - 213
  • [7] Neural Fictitious Self-Play in Imperfect Information Games with Many Players
    Kawamura, Keigo
    Mizukami, Naoki
    Tsuruoka, Yoshimasa
    COMPUTER GAMES (CGW 2017), 2018, 818 : 61 - 74
  • [8] Self-play: Statistical significance
    Haworth, GM
    ICGA JOURNAL, 2003, 26 (02) : 115 - 118
  • [9] Optimal Strategy for Aircraft Pursuit-evasion Games via Self-play Iteration
    Wang, Xin
    Wei, Qing-Lai
    Li, Tao
    Zhang, Jie
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (03) : 585 - 596
  • [10] A Generalized Framework for Self-Play Training
    Hernandez, Daniel
    Denamganai, Kevin
    Gao, Yuan
    York, Peter
    Devlin, Sam
    Samothrakis, Spyridon
    Walker, James Alfred
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,