Imitation Learning for Playing Shogi Based on Generative Adversarial Networks

被引:0
|
作者
Wan, Shanchuan [1 ]
Kaneko, Tomoyuki [2 ,3 ]
机构
[1] Univ Tokyo, Grad Sch Interdisciplinary Informat Studies, Tokyo, Japan
[2] Univ Tokyo, Interfac Initiat Informat Studies, Tokyo, Japan
[3] JST, PRESTO, Kawaguchi, Saitama, Japan
关键词
imitation learning; board games; computer shogi; neural networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For imitation learning in games, AI programs commonly learn thinking and evaluating methods from professional players' game records. However, compared to the total number of all possible game states, top players' records are extremely insufficient. The limited amount of high -quality learning materials may become the bottleneck of training artificial intelligence. We proposed to introduce the idea of Generative Adversarial Networks into game programming, and validated its effectiveness in playing Shogi, a Japanese Chess game. The proposed method is experimentally proved to be capable to alleviate the data insufficiency problem and build more competitive AI programs than conventional supervised training methods.
引用
收藏
页码:92 / 95
页数:4
相关论文
共 50 条
  • [31] MusicGAIL: A Generative Adversarial Imitation Learning Approach for Music Generation
    Liao, Yusong
    Xu, Hongguang
    Xu, Ke
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 505 - 516
  • [32] Bregman Learning for Generative Adversarial Networks
    Gao, Jian
    Tembine, Hamidou
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 82 - 89
  • [33] Collaborative Learning of Generative Adversarial Networks
    Tsukahara, Takuya
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 492 - 499
  • [34] Camera view planning based on generative adversarial imitation learning in indoor active exploration
    Dai, Xu-Yang
    Meng, Qing-Hao
    Jin, Sheng
    Liu, Yin -Bo
    APPLIED SOFT COMPUTING, 2022, 129
  • [35] Dynamic economic dispatch of integrated energy system based on generative adversarial imitation learning
    Zhang, Wei
    Shi, Jianhang
    Wang, Junyu
    Jiang, Yan
    ENERGY REPORTS, 2024, 11 : 5733 - 5743
  • [36] Combining Model-Based Controllers and Generative Adversarial Imitation Learning for Traffic Simulation
    Chen, Haonan
    Ji, Tianchen
    Liu, Shuijing
    Driggs-Campbell, Katherine
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 1698 - 1704
  • [37] Optimal Energy Dispatch for Integrated Energy Systems Based on Generative Adversarial Imitation Learning
    Shi, Yiru
    Zhang, Dahai
    Li, Lixin
    Li, Yaping
    Yun, Yunyun
    Sun, Kai
    Gaodianya Jishu/High Voltage Engineering, 2024, 50 (08): : 3535 - 3544
  • [38] Rule Injection-Based Generative Adversarial Imitation Learning for Knowledge Graph Reasoning
    Wang, Sheng
    Chen, Xiaoyin
    Xiong, Shengwu
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 338 - 350
  • [39] Crack Detection Based on Generative Adversarial Networks and Deep Learning
    Chen, Gongfa
    Teng, Shuai
    Lin, Mansheng
    Yang, Xiaomei
    Sun, Xiaoli
    KSCE JOURNAL OF CIVIL ENGINEERING, 2022, 26 (04) : 1803 - 1816
  • [40] Ensemble-Based Distributed Learning for Generative Adversarial Networks
    Liu, Chonghe
    Ren, Jinke
    Yu, Guanding
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,