Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge

被引:3
|
作者
Gao, Yifan [1 ]
Wu, Lezhou [2 ]
机构
[1] Northeastern Univ, Coll Med & Biol Informat Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
关键词
artificial intelligence; deep learning; AlphaZero; NoGo games; reinforcement learning; GO;
D O I
10.3390/electronics10131533
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computer games have been regarded as an important field of artificial intelligence (AI) for a long time. The AlphaZero structure has been successful in the game of Go, beating the top professional human players and becoming the baseline method in computer games. However, the AlphaZero training process requires tremendous computing resources, imposing additional difficulties for the AlphaZero-based AI. In this paper, we propose NoGoZero+ to improve the AlphaZero process and apply it to a game similar to Go, NoGo. NoGoZero+ employs several innovative features to improve training speed and performance, and most improvement strategies can be transferred to other nonspecific areas. This paper compares it with the original AlphaZero process, and results show that NoGoZero+ increases the training speed to about six times that of the original AlphaZero process. Moreover, in the experiment, our agent beat the original AlphaZero agent with a score of 81:19 after only being trained by 20,000 self-play games' data (small in quantity compared with 120,000 self-play games' data consumed by the original AlphaZero). The NoGo game program based on NoGoZero+ was the runner-up in the 2020 China Computer Game Championship (CCGC) with limited resources, defeating many AlphaZero-based programs. Our code, pretrained models, and self-play datasets are publicly available. The ultimate goal of this paper is to provide exploratory insights and mature auxiliary tools to enable AI researchers and computer-game communities to study, test, and improve these promising state-of-the-art methods at a much lower cost of computing resources.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Comparison of Deep Reinforcement Learning Approaches for Intelligent Game Playing
    Jeerige, Anoop
    Bein, Doina
    Verma, Abhishek
    [J]. 2019 IEEE 9TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2019, : 366 - 371
  • [42] RARSMSDou: Master the Game of DouDiZhu With Deep Reinforcement Learning Algorithms
    Luo, Qian
    Tan, Tien-Ping
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 427 - 439
  • [43] A Deep Reinforcement Learning Based Offloading Game in Edge Computing
    Zhan, Yufeng
    Guo, Song
    Li, Peng
    Zhang, Jiang
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (06) : 883 - 893
  • [44] A Deep Reinforcement Learning-Based Approach in Porker Game
    Kong, Yan
    Rui, Yefeng
    Hsia, Chih-Hsien
    [J]. Journal of Computers (Taiwan), 2023, 34 (02) : 41 - 51
  • [45] Systematic choice of video game benchmarks in Deep Reinforcement Learning
    Gomes, Elvio
    Souza, Marlo
    [J]. 2021 20TH BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2021), 2021, : 162 - 171
  • [46] Deep Reinforcement Learning Methods in Match-3 Game
    Kamaldinov, Ildar
    Makarov, Ilya
    [J]. ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2019, 2019, 11832 : 51 - 62
  • [47] Combining Deep Reinforcement Learning with Prior Knowledge and Reasoning
    Bougie, Nicolas
    Cheng, Li Kai
    Ichise, Ryutaro
    [J]. APPLIED COMPUTING REVIEW, 2018, 18 (02): : 33 - 45
  • [48] Integrating Domain-Knowledge into Deep Learning
    Salakhutdinov, Ruslan
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 3176 - 3176
  • [49] Using Deep Learning to Replace Domain Knowledge
    Luebben, Christian
    Pahl, Marc-Oliver
    Khan, Mohammad Irfan
    [J]. 2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 423 - 428
  • [50] Improving Deep Reinforcement Learning-Based Perimeter Metering Control Methods With Domain Control Knowledge
    Zhou, Dongqin
    Gayah, Vikash V. V.
    [J]. TRANSPORTATION RESEARCH RECORD, 2023, 2677 (07) : 384 - 405